A system for detecting professional skills from resumes written in natural language

In this paper, we present a new method for detecting professional skills (as noun phrases) from resumes written in natural language. The proposed method uses an ontology of skills, the Wikipedia encyclopedia, and a set of standard multi word part-of-speech patterns in order to detect the professiona...

Full description

Saved in:
Bibliographic Details
Published in:2017 13th IEEE International Conference on Intelligent Computer Communication and Processing (ICCP) pp. 189 - 196
Main Authors: Chifu, Emil St, Chifu, Viorica Rozina, Popa, Iulia, Salomie, Ioan
Format: Conference Proceeding
Language:English
Published: IEEE 01-09-2017
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Abstract In this paper, we present a new method for detecting professional skills (as noun phrases) from resumes written in natural language. The proposed method uses an ontology of skills, the Wikipedia encyclopedia, and a set of standard multi word part-of-speech patterns in order to detect the professional skills. First, the method checks to see if there are, in the text of the resumes, skills that are concepts in our ontology. The method also tries to identify possible new skills, which are not present in our ontology. This is done with the help of some specific, lexicalized, multi-word expression patterns (i.e. specific contexts) that could surround new, unknown skills. The specific expression patterns (specific contexts) are induced by training from a corpus of resumes. This induction of the possible specific contexts for new skills is based on a set of standard, generic part-of-speech patterns (found by hand) that usually contain the skills already present in the ontology. Hence our skill extraction method is based on a bootstrapping approach. The newly detected skills are validated by a human expert and then inserted automatically into the skill ontology. Populating the ontology with the new skills is performed with the help of the Wikipedia encyclopedia. The method proposed has been tested on a set of resumes written by users as well as on a corpus collected by automatically extracting resumes from specific Web sites.
AbstractList In this paper, we present a new method for detecting professional skills (as noun phrases) from resumes written in natural language. The proposed method uses an ontology of skills, the Wikipedia encyclopedia, and a set of standard multi word part-of-speech patterns in order to detect the professional skills. First, the method checks to see if there are, in the text of the resumes, skills that are concepts in our ontology. The method also tries to identify possible new skills, which are not present in our ontology. This is done with the help of some specific, lexicalized, multi-word expression patterns (i.e. specific contexts) that could surround new, unknown skills. The specific expression patterns (specific contexts) are induced by training from a corpus of resumes. This induction of the possible specific contexts for new skills is based on a set of standard, generic part-of-speech patterns (found by hand) that usually contain the skills already present in the ontology. Hence our skill extraction method is based on a bootstrapping approach. The newly detected skills are validated by a human expert and then inserted automatically into the skill ontology. Populating the ontology with the new skills is performed with the help of the Wikipedia encyclopedia. The method proposed has been tested on a set of resumes written by users as well as on a corpus collected by automatically extracting resumes from specific Web sites.
Author Salomie, Ioan
Chifu, Emil St
Chifu, Viorica Rozina
Popa, Iulia
Author_xml – sequence: 1
  givenname: Emil St
  surname: Chifu
  fullname: Chifu, Emil St
  email: emil.chifu@cs.utcluj.ro
  organization: Dept. of Comput. Sci., Tech. Univ. of cluj-Napoca, Cluj-Napoca, Romania
– sequence: 2
  givenname: Viorica Rozina
  surname: Chifu
  fullname: Chifu, Viorica Rozina
  email: viorica.chifu@cs.utcluj.ro
  organization: Dept. of Comput. Sci., Tech. Univ. of cluj-Napoca, Cluj-Napoca, Romania
– sequence: 3
  givenname: Iulia
  surname: Popa
  fullname: Popa, Iulia
  organization: Dept. of Comput. Sci., Tech. Univ. of cluj-Napoca, Cluj-Napoca, Romania
– sequence: 4
  givenname: Ioan
  surname: Salomie
  fullname: Salomie, Ioan
  email: iona.salomie@cs.utcluj.ro
  organization: Dept. of Comput. Sci., Tech. Univ. of cluj-Napoca, Cluj-Napoca, Romania
BookMark eNotj8tqwzAURFVoF22aDyjd6AfsXlmyJS-D6SMQSKHZh2v7yojacpBkSv6-Kc1qYDgMZx7YrZ89MfYkIBcC6pdt03zmBQidGyE0gLxh61obUUpTSVkZfc--NjyeY6KJ2znwnhJ1yfmBn8JsKUY3exx5_HbjGLkN88QDxWWiyH-CS4k8d557TEu4YCP6YcGBHtmdxTHS-pordnh7PTQf2W7_vm02u8zVkDJS0CrUrTCGur69WFFVS90rlFD8dUq3VhQd9qShwroyYMtSgAGFIEnKFXv-n3VEdDwFN2E4H69P5S-eqE3m
ContentType Conference Proceeding
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.1109/ICCP.2017.8117003
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume
IEEE Xplore All Conference Proceedings
IEEE Electronic Library Online
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library Online
  url: http://ieeexplore.ieee.org/Xplore/DynWel.jsp
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
EISBN 9781538633687
153863368X
EndPage 196
ExternalDocumentID 8117003
Genre orig-research
GroupedDBID 6IE
6IL
CBEJK
RIE
RIL
ID FETCH-LOGICAL-i90t-e40b4a7b188ecdb815e6937d4a30288ec47bf12cade706a9680f5510804a03e33
IEDL.DBID RIE
IngestDate Thu Jun 29 18:37:33 EDT 2023
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-i90t-e40b4a7b188ecdb815e6937d4a30288ec47bf12cade706a9680f5510804a03e33
PageCount 8
ParticipantIDs ieee_primary_8117003
PublicationCentury 2000
PublicationDate 2017-Sept.
PublicationDateYYYYMMDD 2017-09-01
PublicationDate_xml – month: 09
  year: 2017
  text: 2017-Sept.
PublicationDecade 2010
PublicationTitle 2017 13th IEEE International Conference on Intelligent Computer Communication and Processing (ICCP)
PublicationTitleAbbrev ICCP
PublicationYear 2017
Publisher IEEE
Publisher_xml – name: IEEE
Score 1.752645
Snippet In this paper, we present a new method for detecting professional skills (as noun phrases) from resumes written in natural language. The proposed method uses...
SourceID ieee
SourceType Publisher
StartPage 189
SubjectTerms Data mining
Hidden Markov models
Natural languages
Ontologies
Resumes
Training
Title A system for detecting professional skills from resumes written in natural language
URI https://ieeexplore.ieee.org/document/8117003
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://sdu.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1NSwMxEA22J08qrfhNDh5Nu9tkd5Kj1JZ6kUJ78FZ2k1lYLNvitvj3nWTXquDFW0iGBCaQvJnMe2HsfmR1bq0FYVPIBZ2STmhpjTAyLmyKCUAghc0W8PKqnyZeJufhwIVBxFB8hgPfDG_5bmP3PlU29KTIIO3ZAaMbrlb7UBlHZvg8Hs99rRYMWrtfH6aE-2J68r-VTln_m3jH54cr5YwdYdVji0fe6C1zApjcoU_70yjf_lDV4PVbuV7X3LNFOEXQNH_NPyjwJ0jMy4oH_U4y-0pP9tlyOlmOZ6L9C0GUJtoJVFGuMshjrdG6XMcJpgQsnMokAQTqU5AX8ciX1EOUZibVUUFYiOCgyiKJUp6zbrWp8IJxg7JwFFWhc4VKINKZVRZMMlKWsCHAJet5f6y2jdrFqnXF1d_d1-zYu7ypurph3d37Hm9Zp3b7u7A_nxvelCY
link.rule.ids 310,311,782,786,791,792,798,27934,54767
linkProvider IEEE
linkToHtml http://sdu.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV3PS8MwFA46D3pS2cTf5uDRbu2a9iVHmRsbzjHYDt5Gm7zCcHTDrvjv-5LWqeDFW0geCbxA8r2X931h7L6rZaq1Bk_HkHp0ShpPhlp5KgwyHWME4EhhwxlMXuVT38rkPOy4MIjois-wbZvuLd-sdWlTZR1LinTSngeRgBgqtlb9VBn4qjPq9aa2WgvateWvL1PcjTE4_t9aJ6z1Tb3j092lcsr2MG-y2SOvFJc5QUxu0Cb-aZRvfuhq8OJtuVoV3PJFOMXQNH_BPyj0J1DMlzl3Cp5k9pWgbLH5oD_vDb36NwRvqfyth8JPRQJpICVqk8ogwpighRFJSBCB-gSkWdC1RfXgx4mKpZ8RGiJAKBI_xDA8Y418neM54wrDzFBchcZkIgJfJlpoUFFXaEKHABesaf2x2FR6F4vaFZd_d9-xw-H8ZbwYjybPV-zIur-qwbpmje17iTdsvzDlrdurTxbsl3c
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2017+13th+IEEE+International+Conference+on+Intelligent+Computer+Communication+and+Processing+%28ICCP%29&rft.atitle=A+system+for+detecting+professional+skills+from+resumes+written+in+natural+language&rft.au=Chifu%2C+Emil+St&rft.au=Chifu%2C+Viorica+Rozina&rft.au=Popa%2C+Iulia&rft.au=Salomie%2C+Ioan&rft.date=2017-09-01&rft.pub=IEEE&rft.spage=189&rft.epage=196&rft_id=info:doi/10.1109%2FICCP.2017.8117003&rft.externalDocID=8117003