Workshop Schedule

The workshop will take place both in-person (Vienna, Austria) and online (Underline) on July 31st (Thursday) at ACL 2025.

The SDP 2025 proceedings is now available at https://aclanthology.org/events/sdp-2025/

All long papers are allotted 10 minutes of main presentation followed by 5 minutes for Q&A. All short papers are allotted 7 minutes of main presentation followed by 5 minutes for Q&A. Shared task papers will have poster presentations only (in-person and virtual). The virtual poster session will take place in this separate Zoom session [link], where presenters will be in breakout rooms, sharing their screens.
Program Time (Local Vienna time CEST) Details
Intro & Welcome 09:00 - 09:10 Introduction to the 5th Workshop on Scholarly Document Processing (SDP) by Philipp Mayr
Keynote 1 09:10 - 09:50 Speaker: Lucy Lu Wang (University of Washington and Allen Institute for Artificial Intelligence, USA)
Title: From Paper to Practice: Rethinking Science Communication in the Age of LLMs
Oral Presentations 1 09:50 - 10:30 Three talks:
  • Analyzing the Evolution of Scientific Misconduct based on the Language of Retracted Papers by Christof Bless, Andreas Waldis, Angelina Parfenova, Maria A. Rodriguez, Andreas Marfurt (long)
  • Literature Discovery with Natural Language Queries by Anna Kiepura, Jessica Lam, Nianlong Gu, Richard Hahnloser (short)
  • TeXpert: A Multi-Level Benchmark for Evaluating Latex Code Generation by LLMs by Sahil Kale, Vijaykant Nadadur (short)
Break 1 + Networking 10:30 - 11:00
Keynote 2 11:00 - 11:40 Speaker: Mario Krenn (University of Tübingen, Germany)
Title: Towards an Artificial Muse for New Ideas in Science [online] [Slides]
Shared Tasks at SDP 2025 11:40 - 12:40 SDP 2025 Shared Task Overviews (4)
  • Overview of the SciHal25 Shared Task on Hallucination Detection for Scientific Content by Dan Li, Bogdan Palfi, Colin Zhang, Jaiganesh Subramanian, Adrian Raudaschl, Yoshiko Kakita, Anita De Waard, Zubair Afzal, Georgios Tsatsaronis
  • SciVQA 2025: Overview of the First Scientific Visual Question Answering Shared Task by Ekaterina Borisova, Nikolas Rauscher, Georg Rehm
  • The ClimateCheck Shared Task: Scientific Fact-Checking of Social Media Claims about Climate Change by Raia Abu Ahmad, Aida Usmanova, Georg Rehm
  • SOMD2025: A Challenging Shared Tasks for Software Related Information Extraction by Sharmila Upadhyaya, Wolfgang Otto, Frank Krüger, Stefan Dietze
Lunch (+In-person Posters) 12:40 - 14:00 In-person Posters: Shared Tasks Only
Keynote 3 14:00 - 14:40 Speaker: Eduard Hovy (University of Melbourne, Australia and Carnegie Mellon University, USA)
Title: Scientific Authorship in the Age of LLMs
Oral Presentations 2 14:40 - 13:35 Four talks:
  • Collage: Decomposable Rapid Prototyping for Co-Designed Information Extraction on Scientific PDFs by Sireesh Gururaja, Yueheng Zhang, Guannan Tang, Tianhao Zhang, Kevin Murphy, Yu-Tsen Yi, Junwon Seo, Anthony Rollett, Emma Strubell (long)
  • Data Gatherer: LLM-Powered Dataset Reference Extraction from Scientific Literature by Pietro Marini, Aécio Santos, Nicole Contaxis, Juliana Freire (short)
  • LGAR: Zero-Shot LLM-Guided Neural Ranking for Abstract Screening in Systematic Literature Reviews by Christian Jaumann, Andreas Wiedholz, Annemarie Friedrich (long) [Findings]
  • Predicting Scholarly Impact with Retrieval-Augmented LLMs by Tamjid Azad, Ibrahim Al Azher, Sagnik Ray Choudhury, Hamed Alhoori (short) [online]
Break 2 + Networking 15:30 - 16:00
Oral Presentations 3 16:00 - 17:30 Six talks:
  • MathD2: Towards Disambiguation of Mathematical Terms by Shufan Jiang, Mary Ann Tan, Harald Sack (long)
  • Literature-Grounded Novelty Assessment of Scientific Ideas by Simra Shahid, Marissa Radensky, Raymond Fok, Pao Siangliulue, Daniel S Weld, Tom Hope (long) [online]
  • GraphTranslate: Predicting Clinical Trial Translation using Graph Neural Networks on Biomedical Literature by Emily Muller, Justin Boylan-Toomey, Jack Ekinsmyth, Arne Robben, María de la Paz Cardona, Antonia Langfelder (short)
  • Document Attribution: Examining Citation Relationships using Large Language Models by Vipula Rawte, Ryan A. Rossi, Franck Dernoncourt, Nedim Lipka (short) [online]
  • The ClimateCheck Dataset: Mapping Social Media Claims About Climate Change to Corresponding Scholarly Articles by Raia Abu Ahmad, Aida Usmanova, Georg Rehm (long)
  • FineCite: A Novel Approach For Fine-Grained Citation Context Analysis by Lasse M. Jantsch, Dong-Jae Koh, Seonghwan Yoon, Jisu Lee, Anne Lauscher, Young-Kyoon Suh (long) [Findings]
Remote/Virtual Posters 17:30 - 17:55 Shared Tasks. Join this separate Zoom Link: Posters in breakout rooms
Closing 17:55 - 18:00


Contact: sdproc2024@googlegroups.com

Sign up for updates: https://groups.google.com/g/sdproc-updates

Follow us: https://twitter.com/SDPWorkshop

Back to top