Investigation into biomedical literature classification using support vector machines

Specific topic search in the PubMed Database, one of the most important information resources for scientific community, presents a big challenge to the users. The researcher typically formulates boolean queries followed by scanning the retrieved records for relevance, which is very time consuming an...

Full description

Saved in:
Bibliographic Details
Published in:2005 IEEE Computational Systems Bioinformatics Conference (CSB'05) pp. 366 - 374
Main Authors: Polavarapu, N., Navathe, S.B., Ramnarayanan, R., ul Haque, A., Sahay, S., Liu, Y.
Format: Conference Proceeding Journal Article
Language:English
Published: United States IEEE 2005
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Abstract Specific topic search in the PubMed Database, one of the most important information resources for scientific community, presents a big challenge to the users. The researcher typically formulates boolean queries followed by scanning the retrieved records for relevance, which is very time consuming and error prone. We applied Support Vector Machines (SVM) for automatic retrieval of PubMed articles related to Human genome epidemiological research at CDC (Center for disease Control and Prevention). In this paper, we discuss various investigations into biomedical literature classification and analyze the effect of various issues related to the choice of keywords, training sets, kernel functions and parameters for the SVM technique. We report on the various factors above to show that SVM is a viable technique for automatic classification of biomedical literature into topics of interest such as epidemiology, cancer, birth defects etc. In all our experiments, we achieved high values of PPV, sensitivity and specificity.
AbstractList Specific topic search in the PubMed Database, one of the most important information resources for scientific community, presents a big challenge to the users. The researcher typically formulates boolean queries followed by scanning the retrieved records for relevance, which is very time consuming and error prone. We applied Support Vector Machines (SVM) for automatic retrieval of PubMed articles related to Human genome epidemiological research at CDC (Center for disease Control and Prevention). In this paper, we discuss various investigations into biomedical literature classification and analyze the effect of various issues related to the choice of keywords, training sets, kernel functions and parameters for the SVM technique. We report on the various factors above to show that SVM is a viable technique for automatic classification of biomedical literature into topics of interest such as epidemiology, cancer, birth defects etc. In all our experiments, we achieved high values of PPV, sensitivity and specificity.
Author Ramnarayanan, R.
Polavarapu, N.
Liu, Y.
Navathe, S.B.
Sahay, S.
ul Haque, A.
Author_xml – sequence: 1
  givenname: N.
  surname: Polavarapu
  fullname: Polavarapu, N.
  organization: Sch. of Biol., Georgia Inst. of Technol., Atlanta, GA, USA
– sequence: 2
  givenname: S.B.
  surname: Navathe
  fullname: Navathe, S.B.
– sequence: 3
  givenname: R.
  surname: Ramnarayanan
  fullname: Ramnarayanan, R.
– sequence: 4
  givenname: A.
  surname: ul Haque
  fullname: ul Haque, A.
– sequence: 5
  givenname: S.
  surname: Sahay
  fullname: Sahay, S.
– sequence: 6
  givenname: Y.
  surname: Liu
  fullname: Liu, Y.
BackLink https://www.ncbi.nlm.nih.gov/pubmed/16447994$$D View this record in MEDLINE/PubMed
BookMark eNpF0M9LwzAUB_CAE_fDnTwKkpO3zqRNmuSowx-DgQfduaRpMiNtUpt04H9vcBPf5cF7Hx583xxMnHcagCuMVhgjcbd-e1jlCNFVUZ6BOWKloHlBCJuAGaYUZ4wINgXLED5RKkIxweQCTHGZjBBkBnYbd9Ah2r2M1jtoXfSwtr7TjVWyha2NepBxHDRUrQzBmjT-lWOwbg_D2Pd-iPCgVfQD7KT6sE6HS3BuZBv08tQXYPf0-L5-ybavz5v1_TazeS5iZhBhvCGmIUIho1QjRNloIySn2FCOMS9rjkiOhNQIM05Nw4sasbSnZa1osQC3x7v94L_GlKPqbFC6baXTfgwVQzgnlBYJ3pzgWKdsVT_YTg7f1d8jErg-Aqu1_l8TwVHBix8iYGwH
ContentType Conference Proceeding
Journal Article
DBID 6IE
6IL
CBEJK
RIE
RIL
CGR
CUY
CVF
ECM
EIF
NPM
7X8
DOI 10.1109/CSB.2005.36
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume
IEEE Xplore All Conference Proceedings
IEEE Electronic Library Online
IEEE Proceedings Order Plans (POP All) 1998-Present
Medline
MEDLINE
MEDLINE (Ovid)
MEDLINE
MEDLINE
PubMed
MEDLINE - Academic
DatabaseTitle MEDLINE
Medline Complete
MEDLINE with Full Text
PubMed
MEDLINE (Ovid)
MEDLINE - Academic
DatabaseTitleList
MEDLINE
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library Online
  url: http://ieeexplore.ieee.org/Xplore/DynWel.jsp
  sourceTypes: Publisher
– sequence: 2
  dbid: ECM
  name: MEDLINE
  url: https://search.ebscohost.com/login.aspx?direct=true&db=cmedm&site=ehost-live
  sourceTypes: Index Database
DeliveryMethod fulltext_linktorsrc
Discipline Biology
EndPage 374
ExternalDocumentID 16447994
1498038
Genre orig-research
Evaluation Studies
Journal Article
GroupedDBID 6IE
6IF
6IK
6IL
6IN
AAJGR
AARBI
ALMA_UNASSIGNED_HOLDINGS
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CBEJK
OCL
RIE
RIL
29O
ADZIZ
CGR
CHZPO
CUY
CVF
ECM
EIF
IPLJI
NPM
RNS
7X8
ID FETCH-LOGICAL-i229t-f0478d4fd49c0fccd996def9a851f581186b804209ae01785fd83b07a8556bc53
IEDL.DBID RIE
ISBN 0769523447
9780769523446
ISSN 1551-7497
IngestDate Fri Apr 12 11:23:59 EDT 2024
Thu May 23 23:09:42 EDT 2024
Wed Jun 26 19:27:02 EDT 2024
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-i229t-f0478d4fd49c0fccd996def9a851f581186b804209ae01785fd83b07a8556bc53
Notes ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
PMID 16447994
PQID 70124553
PQPubID 23479
PageCount 9
ParticipantIDs proquest_miscellaneous_70124553
pubmed_primary_16447994
ieee_primary_1498038
PublicationCentury 2000
PublicationDate 20050000
2005-00-00
20050101
PublicationDateYYYYMMDD 2005-01-01
PublicationDate_xml – year: 2005
  text: 20050000
PublicationDecade 2000
PublicationPlace United States
PublicationPlace_xml – name: United States
PublicationTitle 2005 IEEE Computational Systems Bioinformatics Conference (CSB'05)
PublicationTitleAbbrev CSB
PublicationTitleAlternate Proc IEEE Comput Syst Bioinform Conf
PublicationYear 2005
Publisher IEEE
Publisher_xml – name: IEEE
SSID ssj0000451414
ssj0039284
Score 1.4289409
Snippet Specific topic search in the PubMed Database, one of the most important information resources for scientific community, presents a big challenge to the users....
SourceID proquest
pubmed
ieee
SourceType Aggregation Database
Index Database
Publisher
StartPage 366
SubjectTerms Abstracting and Indexing as Topic - methods
Algorithms
Artificial Intelligence
Automatic control
Bioinformatics
Database Management Systems
Diseases
Genomics
Humans
Information resources
Information Storage and Retrieval - methods
Kernel
Natural Language Processing
Pattern Recognition, Automated - methods
Periodicals as Topic
PubMed
Support vector machine classification
Support vector machines
Vocabulary, Controlled
Title Investigation into biomedical literature classification using support vector machines
URI https://ieeexplore.ieee.org/document/1498038
https://www.ncbi.nlm.nih.gov/pubmed/16447994
https://search.proquest.com/docview/70124553
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://sdu.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV09T8MwELVoJyZALVA-PTASmsR2bK-UVp0QUqnEFsUfQZUgqUjT34_PSVMGGNgSxYmis61773z3DqG7PIpIFksVKBKzANQIg8wyHiSWaE0I14mGgNt8wZ_fxNMUZHLuu1oYa61PPrMPcOnP8k2pawiVjR2aFyERPdTjUjS1Wl08BXRSqO_i5Ji5dPSKUt4K7Ozuk7Y-LwrleLJ4bAIqoM3s-6r8DTG9q5kd_e8nj9FwX7OHXzpvdIIObDFAyx8yGmWBV8WmxE3BPcwN_ug0lbEGFA1pQ81ISId_x1W9BniOtz60jz994qWthmg5m75O5kHbSCFYxbHcBDlI8BiaGyp1mGttHMkxNpeZg1s5E45jJEq43RvKzLodKlhuBFEhd89ZojQjp6hflIU9RzgSMqJacSUJo8Q6tudmNeGZo1EmVNKO0ABMkq4brYy0tcYI3e6Mm7r1C4cSWWHLukq585CUMTJCZ43N9686qMalpBe_f_ISHXoZVR8OuUL9zVdtr1GvMvWNXxnfLJa2Sg
link.rule.ids 310,311,315,782,786,791,792,798,4030,4056,4057,27934,27935,27936,54770
linkProvider IEEE
linkToHtml http://sdu.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV09T8MwELVoGWAC1ALlqx4YCU1iO7ZXSqsiSoXUVmKL4o-gSpBUpOH3YzshZYCBLVGcKDrbuvfOd-8AuE6DACUhF55AIfGsGqGXaEK9SCMpEaIykjbgNpnT2Qu7H1mZnJumFkZr7ZLP9K29dGf5KpelDZUNDJpnPmItsEswpX5VrdVEVKxSCnZ9nAw354ZgYUxriZ3v-6iu0At8PhjO76qQilVndp1V_gaZztmMD_73m4egu63ag8-NPzoCOzrrgOUPIY08g6tsk8Oq5N7ODnxrVJWhtDjaJg5VI21C_CssyrUF6PDTBffhu0u91EUXLMejxXDi1a0UvFUY8o2XWhEehVOFufRTKZWhOUqnPDGAKyXMsIxIMLN_fZ5os0cZSRVDwqfmOYmEJOgYtLM806cABowHWAoqOCIYacP3zLxGNDFESvmC6x7oWJPE60otI66t0QP9b-PGZgXbY4kk03lZxNT4SEwI6oGTyubbVw1Yo5zjs98_2Qd7k8XTNJ4-zB7Pwb4TVXXBkQvQ3nyU-hK0ClVeuVXyBeV2uZU
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2005+IEEE+Computational+Systems+Bioinformatics+Conference+%28CSB%2705%29&rft.atitle=Investigation+into+biomedical+literature+classification+using+support+vector+machines&rft.au=Polavarapu%2C+N.&rft.au=Navathe%2C+S.B.&rft.au=Ramnarayanan%2C+R.&rft.au=ul+Haque%2C+A.&rft.date=2005-01-01&rft.pub=IEEE&rft.isbn=9780769523446&rft.spage=366&rft.epage=374&rft_id=info:doi/10.1109%2FCSB.2005.36&rft.externalDocID=1498038
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1551-7497&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1551-7497&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1551-7497&client=summon