Extract information about education institute, grades, year and degree from text using NLP in Python

Question

I want to extract information about education institute, degree, year of passing and grades (CGPA/GPA/Percentage) from text using NLP in Python. For example, if I have the input:

NBN Sinhgad School Of Engineering,Pune 2016 - 2020 Bachelor of Engineering Computer Science CGPA: 8.78 Vidya Bharati Chinmaya Vidyalaya,Jamshedpur 2014 - 2016 Intermediate-PCM,Economics CBSE Percentage: 88.8 Vidya Bharati Chinmaya Vidyalaya,Jamshedpur 2003 - 2014 Matriculation,CBSE CGPA: 8.6 EXPERIENCE

I want the ouput:

[{
  "Institute": "NBN Sinhgad School Of Engineering",
  "Degree": "Bachelor of Engineering Computer Science",
  "Grades": "8.78",
  "Year of Passing": "2020"
}, {
  "Institute": "Vidya Bharati Chinmaya Vidyalaya",
  "Degree": "Intermediate-PCM,Economics",
  "Grades": "88.8",
  "Year of Passing": "2016"
}, {
  "Institute": "Vidya Bharati Chinmaya Vidyalaya",
  "Degree": "Matriculation,CBSE",
  "Grades": "8.6",
  "Year of Passing": "2014"
}]

Can it be done without training any custom NER model? Is there any pre-trained NER available to do this?

Extract information about education institute, grades, year and degree from text using NLP in Python

Answers (1)

Related Questions