Member
Petr Knoth
Snr Research Fellow in Text and Data Mining
I lead the Big Scientific Data and Text Analytics Group (BSDTAG) doing R&D in the domains of text-mining, digital libraries and open access/science. I am the founder and head of CORE (core.ac.uk), a large full text aggregator of open access papers with millions of monthly active users. CORE makes research papers available for people to freely discover and access and for machines to text-mine.
Previously, I worked as a Senior Data Scientist at Mendeley on information extraction and content recommendation for research. I have a deep interest in the use of AI to improve research workflows. I have co-founded Semantometrics.org which aim to go beyond bibliometrics and altmetrics to produce new research evaluation methods that make use of the publication full-texts in research assessment.
I have been involved as a researcher and as a PI in over 20 European Commission, national and international funded research projects in the areas of text-mining, open science and eLearning.
Keys: Natural Language Processing, Text and data mining Open Access, Open Science, Scholarly communication Information Retrieval, Information Extraction, Recommendation systems, Scientometrics
Team: Valeriy Budko, Matteo Cancellieri, Catherine Kuliavets, Samuel Pearce, Nancy Pontika, Maria Tarasiuk
Projects
Technologies
Frictionless Data Exchange Across Research Data, Software and Scientific Paper Repositories
News
05 Sep 2022
29 Jun 2022
23 May 2022
28 Apr 2022
24 Mar 2022
Publications
Kusa, W., Lipani, A., Knoth, P. and Hanbury, A. (2023) An analysis of work saved over sampling in the evaluation of automated citation screening in systematic literature reviews, Intelligent Systems with Applications, 18, Elsevier
Thelwall, M., Kousha, K., Wilson, P., Makita, M., Abdoli, M., Stuart, E., Levitt, J., Knoth, P. and Cancellieri, M. (2023) Predicting article quality scores with machine learning: The UK Research Excellence Framework, Quantitative Science Studies, pp. (early access), MIT Press
Knoth, P., Herrmannova, D., Cancellieri, M., Anastasiou, L., Pontika, N., Pearce, S., Gyawali, B. and Pride, D. (2023) CORE: A Global Aggregation Service for Open Access Papers, Scientific Data, 10, Nature Publishing Group UK
N. Kunnath, S., Pride, D. and Knoth, P. (2022) Dynamic Context Extraction for Citation Classification, The 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing, Virtual
Pontika, N., Klebel, T., Correia, A., Metzler, H., Knoth, P. and Ross-Hellauer, T. (2022) Indicators of research quality, quantity, openness and responsibility in institutional review, promotion and tenure policies across seven countries, Quantitative Science Studies, pp. (Early Access), MIT Press