A B C D E F G H I J K L M N O P Q R S T U V W X Y Z all


Petr KnothMember status icon

Snr Research Fellow in Text and Data Mining
Petr Knoth Photograph

Telephone Icon +44 (0)1908 654548

Email Icon Website Icon Camera Icon RDF Icon

LinkedIn Icon SlideShare Icon

I lead the Big Scientific Data and Text Analytics Group (BSDTAG) doing R&D in the domains of text-mining, digital libraries and open access/science. I am the founder and head of CORE (core.ac.uk), a large full text aggregator of open access papers with millions of monthly active users. CORE makes research papers available for people to freely discover and access and for machines to text-mine.

Previously, I worked as a Senior Data Scientist at Mendeley on information extraction and content recommendation for research. I have a deep interest in the use of AI to improve research workflows. I have co-founded Semantometrics.org which aim to go beyond bibliometrics and altmetrics to produce new research evaluation methods that make use of the publication full-texts in research assessment.

I have been involved as a researcher and as a PI in over 20 European Commission, national and international funded research projects in the areas of text-mining, open science and eLearning.

Keys: Natural Language Processing, Text and data mining Open Access, Open Science, Scholarly communication Information Retrieval, Information Extraction, Recommendation systems, Scientometrics

Team: Valeriy Budko, Matteo Cancellieri, Catherine Kuliavets, Samuel Pearce, Nancy Pontika, Maria Tarasiuk


05 Sep 2022

29 Jun 2022

23 May 2022

28 Apr 2022

24 Mar 2022

View all 99 Articles


Publications | Visit External Site for Details Publications | doi 

Kusa, W., Lipani, A., Knoth, P. and Hanbury, A. (2023) An analysis of work saved over sampling in the evaluation of automated citation screening in systematic literature reviews, Intelligent Systems with Applications, 18, Elsevier

Publications | Visit External Site for Details Publications | doi 

Thelwall, M., Kousha, K., Wilson, P., Makita, M., Abdoli, M., Stuart, E., Levitt, J., Knoth, P. and Cancellieri, M. (2023) Predicting article quality scores with machine learning: The UK Research Excellence Framework, Quantitative Science Studies, pp. (early access), MIT Press

Publications | Visit External Site for Details Publications | doi 

Knoth, P., Herrmannova, D., Cancellieri, M., Anastasiou, L., Pontika, N., Pearce, S., Gyawali, B. and Pride, D. (2023) CORE: A Global Aggregation Service for Open Access Papers, Scientific Data, 10, Nature Publishing Group UK

Publications | Visit External Site for Details Publications | Visit External Site for Details  

N. Kunnath, S., Pride, D. and Knoth, P. (2022) Dynamic Context Extraction for Citation Classification, The 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing, Virtual

Publications | Visit External Site for Details Publications | doi 

Pontika, N., Klebel, T., Correia, A., Metzler, H., Knoth, P. and Ross-Hellauer, T. (2022) Indicators of research quality, quantity, openness and responsibility in institutional review, promotion and tenure policies across seven countries, Quantitative Science Studies, pp. (Early Access), MIT Press

View all 93 publications

View By

Research Themes

Latest Seminar
Dr. Martin Hlosta
Swiss Distance University of Applied Sciences

Learning analytics to provide enhanced feedback for students, teachers and learning designers

Watch the live webcast


Knowledge Media Institute
The Open University
Walton Hall
Milton Keynes
United Kingdom

Tel: +44 (0)1908 653800

Fax: +44 (0)1908 653169

Email: KMi Support


If you have any comments, suggestions or general feedback regarding our website, please email us at the address below.

Email: KMi Development Team