Investigation into biomedical literature classification using support vector machines
Specific topic search in the PubMed Database, one of the most important information resources for scientific community, presents a big challenge to the users. The researcher typically formulates boolean queries followed by scanning the retrieved records for relevance, which is very time consuming an...
Saved in:
Published in: | 2005 IEEE Computational Systems Bioinformatics Conference (CSB'05) pp. 366 - 374 |
---|---|
Main Authors: | , , , , , |
Format: | Conference Proceeding Journal Article |
Language: | English |
Published: |
United States
IEEE
2005
|
Subjects: | |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Abstract | Specific topic search in the PubMed Database, one of the most important information resources for scientific community, presents a big challenge to the users. The researcher typically formulates boolean queries followed by scanning the retrieved records for relevance, which is very time consuming and error prone. We applied Support Vector Machines (SVM) for automatic retrieval of PubMed articles related to Human genome epidemiological research at CDC (Center for disease Control and Prevention). In this paper, we discuss various investigations into biomedical literature classification and analyze the effect of various issues related to the choice of keywords, training sets, kernel functions and parameters for the SVM technique. We report on the various factors above to show that SVM is a viable technique for automatic classification of biomedical literature into topics of interest such as epidemiology, cancer, birth defects etc. In all our experiments, we achieved high values of PPV, sensitivity and specificity. |
---|---|
AbstractList | Specific topic search in the PubMed Database, one of the most important information resources for scientific community, presents a big challenge to the users. The researcher typically formulates boolean queries followed by scanning the retrieved records for relevance, which is very time consuming and error prone. We applied Support Vector Machines (SVM) for automatic retrieval of PubMed articles related to Human genome epidemiological research at CDC (Center for disease Control and Prevention). In this paper, we discuss various investigations into biomedical literature classification and analyze the effect of various issues related to the choice of keywords, training sets, kernel functions and parameters for the SVM technique. We report on the various factors above to show that SVM is a viable technique for automatic classification of biomedical literature into topics of interest such as epidemiology, cancer, birth defects etc. In all our experiments, we achieved high values of PPV, sensitivity and specificity. |
Author | Ramnarayanan, R. Polavarapu, N. Liu, Y. Navathe, S.B. Sahay, S. ul Haque, A. |
Author_xml | – sequence: 1 givenname: N. surname: Polavarapu fullname: Polavarapu, N. organization: Sch. of Biol., Georgia Inst. of Technol., Atlanta, GA, USA – sequence: 2 givenname: S.B. surname: Navathe fullname: Navathe, S.B. – sequence: 3 givenname: R. surname: Ramnarayanan fullname: Ramnarayanan, R. – sequence: 4 givenname: A. surname: ul Haque fullname: ul Haque, A. – sequence: 5 givenname: S. surname: Sahay fullname: Sahay, S. – sequence: 6 givenname: Y. surname: Liu fullname: Liu, Y. |
BackLink | https://www.ncbi.nlm.nih.gov/pubmed/16447994$$D View this record in MEDLINE/PubMed |
BookMark | eNpF0M9LwzAUB_CAE_fDnTwKkpO3zqRNmuSowx-DgQfduaRpMiNtUpt04H9vcBPf5cF7Hx583xxMnHcagCuMVhgjcbd-e1jlCNFVUZ6BOWKloHlBCJuAGaYUZ4wINgXLED5RKkIxweQCTHGZjBBkBnYbd9Ah2r2M1jtoXfSwtr7TjVWyha2NepBxHDRUrQzBmjT-lWOwbg_D2Pd-iPCgVfQD7KT6sE6HS3BuZBv08tQXYPf0-L5-ybavz5v1_TazeS5iZhBhvCGmIUIho1QjRNloIySn2FCOMS9rjkiOhNQIM05Nw4sasbSnZa1osQC3x7v94L_GlKPqbFC6baXTfgwVQzgnlBYJ3pzgWKdsVT_YTg7f1d8jErg-Aqu1_l8TwVHBix8iYGwH |
ContentType | Conference Proceeding Journal Article |
DBID | 6IE 6IL CBEJK RIE RIL CGR CUY CVF ECM EIF NPM 7X8 |
DOI | 10.1109/CSB.2005.36 |
DatabaseName | IEEE Electronic Library (IEL) Conference Proceedings IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume IEEE Xplore All Conference Proceedings IEEE Electronic Library Online IEEE Proceedings Order Plans (POP All) 1998-Present Medline MEDLINE MEDLINE (Ovid) MEDLINE MEDLINE PubMed MEDLINE - Academic |
DatabaseTitle | MEDLINE Medline Complete MEDLINE with Full Text PubMed MEDLINE (Ovid) MEDLINE - Academic |
DatabaseTitleList | MEDLINE |
Database_xml | – sequence: 1 dbid: RIE name: IEEE Electronic Library Online url: http://ieeexplore.ieee.org/Xplore/DynWel.jsp sourceTypes: Publisher – sequence: 2 dbid: ECM name: MEDLINE url: https://search.ebscohost.com/login.aspx?direct=true&db=cmedm&site=ehost-live sourceTypes: Index Database |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Biology |
EndPage | 374 |
ExternalDocumentID | 16447994 1498038 |
Genre | orig-research Evaluation Studies Journal Article |
GroupedDBID | 6IE 6IF 6IK 6IL 6IN AAJGR AARBI ALMA_UNASSIGNED_HOLDINGS BEFXN BFFAM BGNUA BKEBE BPEOZ CBEJK OCL RIE RIL 29O ADZIZ CGR CHZPO CUY CVF ECM EIF IPLJI NPM RNS 7X8 |
ID | FETCH-LOGICAL-i229t-f0478d4fd49c0fccd996def9a851f581186b804209ae01785fd83b07a8556bc53 |
IEDL.DBID | RIE |
ISBN | 0769523447 9780769523446 |
ISSN | 1551-7497 |
IngestDate | Fri Apr 12 11:23:59 EDT 2024 Thu May 23 23:09:42 EDT 2024 Wed Jun 26 19:27:02 EDT 2024 |
IsPeerReviewed | false |
IsScholarly | false |
Language | English |
LinkModel | DirectLink |
MergedId | FETCHMERGED-LOGICAL-i229t-f0478d4fd49c0fccd996def9a851f581186b804209ae01785fd83b07a8556bc53 |
Notes | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23 |
PMID | 16447994 |
PQID | 70124553 |
PQPubID | 23479 |
PageCount | 9 |
ParticipantIDs | proquest_miscellaneous_70124553 pubmed_primary_16447994 ieee_primary_1498038 |
PublicationCentury | 2000 |
PublicationDate | 20050000 2005-00-00 20050101 |
PublicationDateYYYYMMDD | 2005-01-01 |
PublicationDate_xml | – year: 2005 text: 20050000 |
PublicationDecade | 2000 |
PublicationPlace | United States |
PublicationPlace_xml | – name: United States |
PublicationTitle | 2005 IEEE Computational Systems Bioinformatics Conference (CSB'05) |
PublicationTitleAbbrev | CSB |
PublicationTitleAlternate | Proc IEEE Comput Syst Bioinform Conf |
PublicationYear | 2005 |
Publisher | IEEE |
Publisher_xml | – name: IEEE |
SSID | ssj0000451414 ssj0039284 |
Score | 1.4289409 |
Snippet | Specific topic search in the PubMed Database, one of the most important information resources for scientific community, presents a big challenge to the users.... |
SourceID | proquest pubmed ieee |
SourceType | Aggregation Database Index Database Publisher |
StartPage | 366 |
SubjectTerms | Abstracting and Indexing as Topic - methods Algorithms Artificial Intelligence Automatic control Bioinformatics Database Management Systems Diseases Genomics Humans Information resources Information Storage and Retrieval - methods Kernel Natural Language Processing Pattern Recognition, Automated - methods Periodicals as Topic PubMed Support vector machine classification Support vector machines Vocabulary, Controlled |
Title | Investigation into biomedical literature classification using support vector machines |
URI | https://ieeexplore.ieee.org/document/1498038 https://www.ncbi.nlm.nih.gov/pubmed/16447994 https://search.proquest.com/docview/70124553 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://sdu.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV09T8MwELVoJyZALVA-PTASmsR2bK-UVp0QUqnEFsUfQZUgqUjT34_PSVMGGNgSxYmis61773z3DqG7PIpIFksVKBKzANQIg8wyHiSWaE0I14mGgNt8wZ_fxNMUZHLuu1oYa61PPrMPcOnP8k2pawiVjR2aFyERPdTjUjS1Wl08BXRSqO_i5Ji5dPSKUt4K7Ozuk7Y-LwrleLJ4bAIqoM3s-6r8DTG9q5kd_e8nj9FwX7OHXzpvdIIObDFAyx8yGmWBV8WmxE3BPcwN_ug0lbEGFA1pQ81ISId_x1W9BniOtz60jz994qWthmg5m75O5kHbSCFYxbHcBDlI8BiaGyp1mGttHMkxNpeZg1s5E45jJEq43RvKzLodKlhuBFEhd89ZojQjp6hflIU9RzgSMqJacSUJo8Q6tudmNeGZo1EmVNKO0ABMkq4brYy0tcYI3e6Mm7r1C4cSWWHLukq585CUMTJCZ43N9686qMalpBe_f_ISHXoZVR8OuUL9zVdtr1GvMvWNXxnfLJa2Sg |
link.rule.ids | 310,311,315,782,786,791,792,798,4030,4056,4057,27934,27935,27936,54770 |
linkProvider | IEEE |
linkToHtml | http://sdu.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV09T8MwELVoGWAC1ALlqx4YCU1iO7ZXSqsiSoXUVmKL4o-gSpBUpOH3YzshZYCBLVGcKDrbuvfOd-8AuE6DACUhF55AIfGsGqGXaEK9SCMpEaIykjbgNpnT2Qu7H1mZnJumFkZr7ZLP9K29dGf5KpelDZUNDJpnPmItsEswpX5VrdVEVKxSCnZ9nAw354ZgYUxriZ3v-6iu0At8PhjO76qQilVndp1V_gaZztmMD_73m4egu63ag8-NPzoCOzrrgOUPIY08g6tsk8Oq5N7ODnxrVJWhtDjaJg5VI21C_CssyrUF6PDTBffhu0u91EUXLMejxXDi1a0UvFUY8o2XWhEehVOFufRTKZWhOUqnPDGAKyXMsIxIMLN_fZ5os0cZSRVDwqfmOYmEJOgYtLM806cABowHWAoqOCIYacP3zLxGNDFESvmC6x7oWJPE60otI66t0QP9b-PGZgXbY4kk03lZxNT4SEwI6oGTyubbVw1Yo5zjs98_2Qd7k8XTNJ4-zB7Pwb4TVXXBkQvQ3nyU-hK0ClVeuVXyBeV2uZU |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2005+IEEE+Computational+Systems+Bioinformatics+Conference+%28CSB%2705%29&rft.atitle=Investigation+into+biomedical+literature+classification+using+support+vector+machines&rft.au=Polavarapu%2C+N.&rft.au=Navathe%2C+S.B.&rft.au=Ramnarayanan%2C+R.&rft.au=ul+Haque%2C+A.&rft.date=2005-01-01&rft.pub=IEEE&rft.isbn=9780769523446&rft.spage=366&rft.epage=374&rft_id=info:doi/10.1109%2FCSB.2005.36&rft.externalDocID=1498038 |
thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1551-7497&client=summon |
thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1551-7497&client=summon |
thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1551-7497&client=summon |