Detecting text level intellectual influence with knowledge graph embeddings

Introduction: Tracing the spread of ideas and the presence of influence is a question of special importance across a wide range of disciplines, ranging from intellectual history to cultural analytics, computational social science, and the science of science. Method: We collect a corpus of open sourc...

Full description

Saved in:
Bibliographic Details
Main Authors: Li, Lucian, Silva, Eryclis
Format: Journal Article
Language:English
Published: 31-10-2024
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Abstract Introduction: Tracing the spread of ideas and the presence of influence is a question of special importance across a wide range of disciplines, ranging from intellectual history to cultural analytics, computational social science, and the science of science. Method: We collect a corpus of open source journal articles, generate Knowledge Graph representations using the Gemini LLM, and attempt to predict the existence of citations between sampled pairs of articles using previously published methods and a novel Graph Neural Network based embedding model. Results: We demonstrate that our knowledge graph embedding method is superior at distinguishing pairs of articles with and without citation. Once trained, it runs efficiently and can be fine-tuned on specific corpora to suit individual researcher needs. Conclusion(s): This experiment demonstrates that the relationships encoded in a knowledge graph, especially the types of concepts brought together by specific relations can encode information capable of revealing intellectual influence. This suggests that further work in analyzing document level knowledge graphs to understand latent structures could provide valuable insights.
AbstractList Introduction: Tracing the spread of ideas and the presence of influence is a question of special importance across a wide range of disciplines, ranging from intellectual history to cultural analytics, computational social science, and the science of science. Method: We collect a corpus of open source journal articles, generate Knowledge Graph representations using the Gemini LLM, and attempt to predict the existence of citations between sampled pairs of articles using previously published methods and a novel Graph Neural Network based embedding model. Results: We demonstrate that our knowledge graph embedding method is superior at distinguishing pairs of articles with and without citation. Once trained, it runs efficiently and can be fine-tuned on specific corpora to suit individual researcher needs. Conclusion(s): This experiment demonstrates that the relationships encoded in a knowledge graph, especially the types of concepts brought together by specific relations can encode information capable of revealing intellectual influence. This suggests that further work in analyzing document level knowledge graphs to understand latent structures could provide valuable insights.
Author Li, Lucian
Silva, Eryclis
Author_xml – sequence: 1
  givenname: Lucian
  surname: Li
  fullname: Li, Lucian
– sequence: 2
  givenname: Eryclis
  surname: Silva
  fullname: Silva, Eryclis
BackLink https://doi.org/10.48550/arXiv.2410.24021$$DView paper in arXiv
BookMark eNqFjbsOgkAURLfQwtcHWHl_QASExN5HTGztyQoDbLwsZFke_r1A7K0mczKZsxQzXWoIsfVcJziFoXuQplet4wcD8APX9xbicYFFbJXOyKK3xGjBpLQF88AbOZaUG-gY1Cmb01uXHSPJQJmRVU4oXkiS4aBei3kqucbmlyuxu12f5_t-0kaVUYU0n2jUR5P--H_xBWQnPUA
ContentType Journal Article
Copyright http://creativecommons.org/licenses/by/4.0
Copyright_xml – notice: http://creativecommons.org/licenses/by/4.0
DBID AKY
GOX
DOI 10.48550/arxiv.2410.24021
DatabaseName arXiv Computer Science
arXiv.org
DatabaseTitleList
Database_xml – sequence: 1
  dbid: GOX
  name: arXiv.org
  url: http://arxiv.org/find
  sourceTypes: Open Access Repository
DeliveryMethod fulltext_linktorsrc
ExternalDocumentID 2410_24021
GroupedDBID AKY
GOX
ID FETCH-arxiv_primary_2410_240213
IEDL.DBID GOX
IngestDate Sat Nov 02 12:35:36 EDT 2024
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-arxiv_primary_2410_240213
OpenAccessLink https://arxiv.org/abs/2410.24021
ParticipantIDs arxiv_primary_2410_24021
PublicationCentury 2000
PublicationDate 2024-10-31
PublicationDateYYYYMMDD 2024-10-31
PublicationDate_xml – month: 10
  year: 2024
  text: 2024-10-31
  day: 31
PublicationDecade 2020
PublicationYear 2024
Score 3.8806996
SecondaryResourceType preprint
Snippet Introduction: Tracing the spread of ideas and the presence of influence is a question of special importance across a wide range of disciplines, ranging from...
SourceID arxiv
SourceType Open Access Repository
SubjectTerms Computer Science - Computation and Language
Title Detecting text level intellectual influence with knowledge graph embeddings
URI https://arxiv.org/abs/2410.24021
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://sdu.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwY2BQSUwxN00yTQStqTIw0TVJM7DUBbZaE3UtjA1TLJITjY0TwQfPewSb-0VYuLiCjslRgO2FSSyqyCyDnA-cVKwPrF4M9EDj_8D-DbOREWjJlrt_BGRyEnwUF1Q9Qh2wjQkWQqok3AQZ-KGtOwVHSHQIMTCl5okweLukgkbqgXWEAmiZhUIOaJ2OQibS_g0gB3pViAJoXFQBPtClAD5PWiE1Nyk1BTxJJMog7-Ya4uyhC7Y-vgByVkQ8yGXxYJcZizGwAHv0qRIMCsBmfFKakXkisG2QZmJmnJxoaWIObLklpZmlGaYAOzCSDBK4TJHCLSXNwGUErHEhBasMA0tJUWmqLANzcUqpHDjYAIdIcB0
link.rule.ids 228,230,782,887
linkProvider Cornell University
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Detecting+text+level+intellectual+influence+with+knowledge+graph+embeddings&rft.au=Li%2C+Lucian&rft.au=Silva%2C+Eryclis&rft.date=2024-10-31&rft_id=info:doi/10.48550%2Farxiv.2410.24021&rft.externalDocID=2410_24021