An #AI Tool Recreates Some of the Friction of Academic Search. Is that Actually Helpful?
An #AI Tool Recreates Some of the Friction of Academic Search. Is that Actually Helpful?
How do we know how much we know-- and what?
"Over the past decades, several methods have been proposed to ease the laborious task of searching for relevant papers. Various books and review articles summarize these methodologies, findings, and implications. However, they often fail to provide a detailed retrospective of recent advances, including their evolution, current state, and challenges. This article addresses that gap by reviewing the most relevant and authoritative literature on advances in academic search systems. It proposes a generic layered architecture of scholarly retrieval systems and provides a detailed analysis of ranking methodologies, datasets, and evaluation methods. Additionally, it identifies critical open research challenges and issues, offering a promising foundation for future research and development in this vital field."
#AcademicSearch
#ScholarlyRetrieval
#LIbraryScience
https://link.springer.com/article/10.1007/s13278-025-01476-1
Most countries today emphasize scientific research and invest heavily in it. As a result, the number of scholarly documents has increased dramatically. Both academics and industry professionals seek to retrieve relevant papers efficiently. However, identifying relevant documents on a given topic using current academic search systems is challenging. The reasons include the exponential growth of research publications, ambiguity and limitations in searchers’ keywords, and the complexity of citation networks. Over the past decades, several methods have been proposed to ease the laborious task of searching for relevant papers. Various books and review articles summarize these methodologies, findings, and implications. However, they often fail to provide a detailed retrospective of recent advances, including their evolution, current state, and challenges. This article addresses that gap by reviewing the most relevant and authoritative literature on advances in academic search systems. It proposes a generic layered architecture of scholarly retrieval systems and provides a detailed analysis of ranking methodologies, datasets, and evaluation methods. Additionally, it identifies critical open research challenges and issues, offering a promising foundation for future research and development in this vital field.
Full details of our new subject classifier can be found in Christoph Broschinski's Master's thesis: https://nbn-resolving.org/urn:nbn:de:hbz:79pbc-opus-25138
In der wissenschaftliche Suchmaschine BASE werden bereits seit Jahren Dokumente maschinell nach der Dewey Decimal Classification (DDC) erschlossen. Die vorliegende Arbeit beschreibt die Erstellung eines Systems des maschinellen Lernens mit dem Ziel, das mittlerweile veraltete Klassifikationssystem in BASE zu ersetzen. Zu diesem Zweck ist es erforderlich, Daten aus BASE zu gewinnen, die als Trainingsmenge eines maschinellen Lernverfahrens dienen können. Es wird gezeigt, wie mithilfe einer explorativen Analyse aus einem Korpus von über 220 Mio. Dokumenten geeignete Daten extrahiert, kuratiert und zu sprachspezifischen Lernkorpora umgearbeitet werden können, die hierzu entwickelte Software ist ein integraler Bestandteil dieser Arbeit. Auf dieser Grundlage werden mithilfe des Toolkits Annif eine Reihe von Klassifikatoren erstellt, deren Leistungsfähigkeit anschließend evaluiert und ein geeigneter Kandidat ausgewählt wird. Ein finaler Vergleich zeigt, dass das in dieser Ausarbeitung erstellte System dem zur Zeit im Einsatz befindlichen BASE-Klassifikator weit überlegen ist. Abschließende Betrachtungen zeigen allerdings auch verschiedene Schwächen des Ansatzes auf, die zugleich einen Bogen zu allgemeinen Erwägungen im Rahmen des derzeitigen „Frühlings“ der künstlichen Intelligenz schlagen.
#BASE wird heute 20 Jahre alt!
Die damalige Ankündigung hat im Prinzip nichts von ihrer Aktualität verloren: https://www.inetbib.de/listenarchiv/msg24765.html
Inzwischen findet man in BASE Nachweise für fast 400 Mio. akademische Dokumente aus knapp 11.500 Quellen und viele Links zu Volltexten: https://base-search.net
This toot in English: https://openbiblio.social/@base/112670487737084794
#BASEsearch #academicSearch #searchEngine #scholComm #Suchmaschine #akademischeSuche #Suche #BASEsuche #Wissenschaftskommunikation
#BASE indexes 340+ million documents from more than 11.111 sources (publication repositories, publishers etc.).
https://www.base-search.net/
Want to try a different search experience? Go to one of our partners:
– @MetaGer's science tab gets its results from us: https://metager.org/?focus=science
– #OpenKnowledgeMaps also uses #BASEsearch by default, and clusters results to give you an overview of your subject: https://openknowledgemaps.org/
#MetaGer #academicSearch #scholarlySearch #publicationSearch #searchEngine
Fancy working with me? The Faculty of Science and Engineering at the University of Wolverhampton offers, among others, a PhD studentship on Large Language Models for Academic Search. Please get in touch with me if you’re interested. Look for the LASER project at https://www.wlv.ac.uk/schools-and-institutes/faculty-of-science-and-engineering/research/phd-studentships/
#phd #ai #informationRetrieval #studentship #largeLanguageModels #chatgpt #chatgpt4 #bard #nlp #computationalLinguistics #search #academicSearch #recommendersystems #university #research