Workshop Schedule (UPDATED)

The workshop will take place both in-person (in Lotus Suite 15) and online on August 16th, 2024. at ACL 2024.

The SDP 2024 proceedings are available at this link

Program Time (Local) Details
Intro & Welcome 09:00 - 09:10 Introduction to the 4th Workshop on Scholarly Document Processing (SDP)
Keynote 1 09:10 - 09:50 Speaker: Iryna Gurevych (Technical University Darmstadt and head of the UKP Lab)
Title: How to InterText? Elevating NLP to the cross-document level
Oral Talks 1 09:50 - 10:35 3 talks:
  • Simulating Expert Discussions with Multi-agent for Enhanced Scientific Problem Solving (Ziyue Li, Yuan Chang, Xiaoqiu Le) (in-person)
  • CoSAEmb: Contrastive Section-aware Aspect Embeddings for Scientific Articles (Shruti Singh, Mayank Singh) (in-person)
  • AffilGood: Building reliable institution name disambiguation tools to improve scientific literature analysis (Nicolau Duran-Silva, Pablo Accuosto, Piotr Przybyła, Horacio Saggion) (in-person)
Break 1 + Networking 10:35 - 11:00
Keynote 2 11:00 - 11:40 Speaker: Doug Downey (Northwestern University and Allen Institute for AI, USA)
Title: Chasing high-precision NLP at discount prices: Lessons for accelerating science
Shared Tasks 11:40 - 12:10 2 overview talks:
  • DAGPap24: Detecting Automatically Generated Scientific Papers
  • Context24: Contextualizing Scientific Figures and Tables
Oral Talks 2 12:10 - 12:40 2 talks:
  • An Analysis of Tasks and Datasets in Peer Reviewing (Moritz Staudinger, Wojciech Kusa, Florina Piroi, Allan Hanbury) (in-person)
  • Controllable Citation Sentence Generation with Language Models (Nianlong Gu, Richard Hahnloser) (remote)
Lunch (+In-person Posters) 12:40 - 14:10 In-person posters:
  • Artificial Intuition: Efficient Classification of Scientific Abstracts (Harsh Sakhrani, Naseela Pervez, Anirudh Ravi Kumar, Fred Morstatter, Alexandra Graddy-Reed, Andrea Belz)
  • Zero-shot Scientific Claim Verification Using LLMs and Citation Text (Maxwell Bennett, Carlos Alvarez, Lucy Wang)
  • First Steps in Building a Knowledge Base of Mathematical Results (Shrey Mishra, Yacine Brihmouche, Théo Delemazure, Antoine Gauquier, Pierre Senellart)
  • CiteAssist: A System for Automated Preprint Citation and BibTeX Generation (Lars Benedikt Kaesberg, Terry Ruas, Jan Philip Wahle, Bela Gipp)
  • Researcher Representations Based on Aggregating Embeddings of Publication Titles: A Case Study in a Japanese Academic Database (Hiroyoshi Nagao, Marie Katsurai)
  • CSIRO-LT at Context24: Contextualising Scientific Figures and Tables in Scientific Literature (Necva Bölücü, Vincent Nguyen, Roelien C. Timmer, Huichen Yang, Maciej Rybinski, Stephen Wan, Sarvnaz Karimi)
  • OSX at Context24: How Well Can GPT Tackle Contexualizing Scientific Figures and Tables (Tosho Hirasawa)
  • Guiding Large Language Models via External Attention Prompting for Scientific Extreme Summarization (Yuan Chang, Ziyue Li, Xiaoqiu Le)
  • MISTI: Metadata-Informed Scientific Text and Image Representation through Contrastive Learning (Pawin Taechoyotin, Daniel Acuna)
  • Synthetic Context with LLM for Entity Linking from Scientific Tables (Yuji Oshima, Hiroyuki Shindo, Hiroki Teranishi, Hiroki Ouchi, Taro Watanabe)
  • Beyond Retrieval: Topic-based Alignment of Scientific Papers to Research Proposal (Rudra Nath Palit, Manasi Patwardhan, Lovekesh Vig, Gautam Shroff)
  • AutoRef: Generating Refinements of Reviews Given Guidelines (Soham Chitnis, Manasi Patwardhan, Ashwin Srinivasan, Tanmay Tulsidas Verlekar, Lovekesh Vig, Gautam Shroff)
  • Toward Related Work Generation with Structure and Novelty Statement (Kazuya Nishimura, Kuniaki Saito, Tosho Hirasawa, Yoshitaka Ushiku)
Remote posters:
  • Understanding Survey Paper Taxonomy about Large Language Models via Graph Representation Learning (Jun Zhuang, Casey Kennington)
  • Papilusion at DAGPap24: Paper or Illusion? Detecting AI-generated Scientific Papers (Nikita Andreev, Alexander Shirnin, Vladislav Mikhailov, Ekaterina Artemova)
  • Harnessing CLIP for Evidence Identification in Scientific Literature: A Multimodal Approach to Context24 Shared Task (Anukriti Kumar, Lucy Lu Wang)
  • An end-to-end entity recognition and disambiguation framework for identifying Author Affiliation from literature publications (Lianghong Lin, Wenxiu Xie, Zili Chen, Tianyong Hao)
  • Utilizing an Ensemble Model with Anomalous Label Smoothing to Detect Generated Scientific Papers (Yuan Zhao, Junruo Gao, Junlin Wang, Gang Luo, Liang Tang)
  • Cited Text Spans for Scientific Citation Text Generation (Xiangci Li, Yi-Hui Lee, Jessica Ouyang)
  • Multi-head Span-based Detector for AI-generated Fragments in Scientific Papers (German Gritsai, Ildar Khabutdinov, Andrey Grabovoy)
Keynote 3 14:10 - 14:50 Speaker: Heng Ji (University of Illinois at Urbana-Champaign, USA)
Title: AI Plays Medicinal Chemist
Oral Talks 3 14:50 - 15:30 3 talks:
  • Metadata Enhancement Using Large Language Models (Hyunju Song, Steven Bethard, Andrea Thomer) (remote)
  • Integrating Table Representations into Large Language Models for Improved Scholarly Document Comprehension (Buse Sibel Korkmaz, Antonio del Rio Chanona) (in-person)
  • Evaluating the Effectiveness of Retrieval-Augmented Large Language Models in Scientific Document Reasoning (Sai Munikoti, Anurag Acharya, Sridevi Wagle, Sameera Horawalavithana) (remote)
Break 2 + Networking 15:30 - 16:00
Keynote 4 16:00 - 16:40 Speaker: Anna Rogers (University of Copenhagen)
Title: Large language models as research assistants: workflows and challenges
Panel Discussion 16:40 - 17:20
Closing 17:20 - 17:30


Contact: sdproc2024@googlegroups.com

Sign up for updates: https://groups.google.com/g/sdproc-updates

Follow us: https://twitter.com/SDPWorkshop

Back to top