Petr KnothSnr Research Fellow in Text and Data Mining
I lead the Big Scientific Data and Text Analytics Group (BSDTAG) doing R&D in the domains of text-mining, digital libraries and open access/science. I am the founder and head of CORE (core.ac.uk), a large full text aggregator of open access papers with millions of monthly active users. CORE makes research papers available for people to freely discover and access and for machines to text-mine.
Previously, I worked as a Senior Data Scientist at Mendeley on information extraction and content recommendation for research. I have a deep interest in the use of AI to improve research workflows. I have co-founded Semantometrics.org which aim to go beyond bibliometrics and altmetrics to produce new research evaluation methods that make use of the publication full-texts in research assessment.
I have been involved as a researcher and as a PI in over 20 European Commission, national and international funded research projects in the areas of text-mining, open science and eLearning.
Keys: Natural Language Processing, Text and data mining Open Access, Open Science, Scholarly communication Information Retrieval, Information Extraction, Recommendation systems, Scientometrics
Team: Lucas Anastasiou, Valerii Budko, Matteo Cancellieri, Bikash Gyawali, Kateryna Kuliavets, Sergei Misak, Samuel Pearce, Nancy Pontika, David Pride, Svetlana Rumyanceva, Maria Tarasiuk, Viktor Yakubiv
13 Jul 2020
11 Jun 2020
27 May 2020
22 May 2020
20 May 2020
Gyawali, B., Pontika, N. and Knoth, P. (2020) Open Access 2007 - 2017: Country and University Level Perspective, Joint Conference on Digital Libraries, Virtual Event, China
Pride, D. and Knoth, P. (2020) An Authoritative Approach to Citation Classification, ACM/IEEE Joint Conference on Digital Libraries in 2020 (JCDL '20), Virtual - China
Gyawali, B., Anastasiou, L. and Knoth, P. (2020) Deduplication of Scholarly Documents using Locality Sensitive Hashing and Word Embeddings, 12th Language Resources and Evaluation Conference, Marseille, France
Knoth, P., Anastasiou, L., Cancellieri, M., Gyawali, B., Herrmannova, D., Misak, S., Huba, A., Pearce, S., Pontika, N., Rumyanceva, S. and Tarasiuk, M. (2019) Aggregating The World's Open Access Research Papers
Herrmannova, D., Pontika, N. and Knoth, P. (2019) Do Authors Deposit on Time? Tracking Open Access Policy Compliance, 2019 ACM/IEEE Joint Conference on Digital Libraries, Urbana-Champaign, IL