About the workshop

Highlight

  • Submission site is now available. All deadlines below are at 23:59 UTC-12 (Anywhere on Earth).
  • The paper submission deadline (main workshop) is extended to March 15, 2025.

You are invited to participate in the 5th Workshop on Scholarly Document Processing (SDP 2025) to be held at ACL 2025 in Vienna, Austria. SDP 2025 will consist of a research track and five shared tasks. The call for research papers is described below.

Papers must follow the ACL format and conform to the ACL 2025 Submission Guidelines. Paper submission has to be done through openreview: 5th SDP at ACL 2025 Workshop.

Introduction

Scholarly literature is the chief means by which scientists and academics document and communicate their results and is therefore critical to the advancement of knowledge and improvement of human well-being. At the same time, this literature poses challenges to NLP uncommon in other genres, such as specialized language and high background knowledge requirements, long documents and strong structural conventions, multimodal presentation, citation relationships among documents, an emphasis on rational argumentation, and the frequent availability of detailed metadata and experimental data. These challenges necessitate the development of NLP methods and resources optimized for this domain. The Scholarly Document Processing (SDP) workshop provides a venue for discussing these challenges, bringing together stakeholders from different communities including computational linguistics, machine learning, text mining, information retrieval, digital libraries, scientometrics and others, to develop methods, tasks, and resources in support of these goals.

This workshop builds on the success of prior workshops: the 1st, 2nd, 3rd, and 4th SDP workshops held at EMNLP 2020, NAACL 2021, COLING 2022, and ACL 2024, and the 1st and 2nd SciNLP workshops held at AKBC 2020 and 2021. In addition to having broad appeal within the NLP community, we hope the SDP workshop will attract researchers from other relevant fields including meta-science, scientometrics, data mining, information retrieval, and digital libraries, bringing together these disparate communities within ACL.

Topics of interest

We invite submissions from all communities demonstrating usage of and challenges associated with natural language processing, information retrieval, and data mining of scholarly and scientific documents. Relevant tasks include (but are not limited to):

  • Large Language Models (LLMs) for Science
  • Representation learning and language modeling
  • Information extraction and NER
  • Document understanding
  • Summarization and generation
  • Question-answering
  • Discourse modeling/argumentation mining
  • Network analysis
  • Bibliometrics, scientometrics, and altmetrics
  • Reproducibility and research integrity, including new challenges posed by generative AI
  • Peer review tools, principles and technology
  • Metadata and indexing
  • Inclusion of datasets and computational resources
  • Research infrastructures and digital libraries
  • Increasing the representation in scholarly work of disadvantaged populations
  • LLM-based interfaces to consume/produce scholarly documents
  • Impact of scholarly communication on popular discourse

Inclusivity

We encourage submissions from diverse voices. Anyone who self-identifies with an underrepresented demographic, regardless of seniority, scientific background, training, etc., is welcome to submit their work and attend the workshop. We aim to create an inclusive space and look forward to receiving submissions from researchers of all identities, living anywhere, whether first time authors or more experienced writers.

Submission Information

Authors are invited to submit full and short papers with unpublished, original work. Submissions will be subject to a double-blind peer review process. Accepted papers will be presented by the authors at the workshop either as a talk or a poster. All accepted papers will be published in the workshop proceedings. All accepted papers will be published in the workshop proceedings (proceedings from previous years can be found in the ), which will be published in the ACL Anthology.

The submissions must be in PDF format and anonymized for review. All submissions must be written in English and follow the ACL 2025 formatting requirements:

Long paper submissions: up to 8 pages of content, plus unlimited references.

Short paper submissions: up to 4 pages of content, plus unlimited references.

Submission Website: Submission is electronic, using the OpenReview conference management system.

Final versions of accepted papers will be allowed 1 additional page of content so that reviewer comments can be taken into account.

To receive updates, please join our mailing list or follow us on Twitter.

Important Dates

The dates are at this stage indicative only and can change. The current dates are up-to-date as of Dec 18th, 2025.

Event Date
Paper Submission Deadline March 1, 2025 March 15, 2025 (23:59 UTC-12)
Pre-reviewed (ARR) Submission Deadline March 25, 2025 (23:59 UTC-12)
Notification of acceptance May 10, 2025 (23:59 UTC-12)
Camera-ready submissions due May 30, 2025 (23:59 UTC-12)
Workshop Dates July 31, 2025

Keynote Speakers

We are excited to have several keynote speakers at SDP 2025. The following speakers have been confirmed.

  • Tom Hope, Assistant Professor at Hebrew University of Jerusalem and Research Scientist at Allen Institute for AI.
  • James A. Evans, Professor and Director of the Knowledge Lab at University of Chicago and External Professor at the Santa Fe Institute.
  • TBA

Shared Tasks

SDP 2025 will host five exciting shared tasks. More information about all shared tasks is provided on the shared tasks page. Each shared task will follow-up with a separate CfP.

The shared task track includes the following tasks:

  1. Contextualizing Scientific Figures and Tables (Context25)
  2. SciVQA: Scientific Visual Question Answering
  3. ClimateCheck: Scientific Fact-checking of Social Media Posts on Climate Change
  4. SOMD: Software Mention Detection in Scholarly Publications

For detailed information about the tasks, data, evaluation, and organizers, please see the shared tasks page.

Navigation

  • Introduction
  • Topics of Interest
  • Inclusivity
  • Submission Information
  • Important Dates
  • Keynote Speakers
  • Shared Tasks

Sign up for updates

https://groups.google.com/g/sdproc-updates

Contact

Please contact sdproc2025@googlegroups.com with any questions regarding the workshop.

Follow us

https://twitter.com/SDPWorkshop



Contact: sdproc2025@googlegroups.com

Sign up for updates: https://groups.google.com/g/sdproc-updates

Follow us: https://twitter.com/SDPWorkshop

Back to top