Shaden Shaar
PhD Student · Cornell University · Computer Science
sshaar31@gmail.com · Ithaca, NY
RESEARCH INTERESTS
Natural Language Processing for complex textual analysis, specializing in long-form text generation and evaluation. My current work explores narrative summarization and document-level question answering, with an emphasis on maintaining semantic coherence across extended passages. Previously, I developed automated fact-checking systems and detection mechanisms for propaganda in multi-length documents, contributing to information integrity in digital content.
EDUCATION
- 2021 — presentPhD, Computer ScienceCornell University · Ithaca, NYMinor: Applied Mathematics
- 2021 — 2024M.S., Computer ScienceCornell University · Ithaca, NY
- 2015 — 2019B.S., Computer ScienceCarnegie Mellon University · Doha, QatarMinor: Mathematics · University Honors
EXPERIENCE
- May 2025 — Aug 2025Applied Scientist InternZillow Group · Remote
- Jan 2025 — May 2025Machine Learning Research Engineer InternScale AI · Remote
- May 2022 — Aug 2022AI/ML Research InternApple · Cupertino, CA
- Jul 2019 — Aug 2021Research AssistantQatar Computing Research Institute, HBKU · Doha, Qatar
- May 2018 — Aug 2018Research InternRobotics Institute, Carnegie Mellon University · Pittsburgh, PA
- May 2017 — Jun 2018Part-Time Research AssistantCarnegie Mellon University · Doha, Qatar
TEACHING
Cornell University
- Fall 2022CS 4740: Natural Language Processing
- Spring 2022Introduction to Machine Learning
Carnegie Mellon University
- Fall 2018 & Spring 201911-785: Introduction to Deep Learning
- Spring 2018 & Spring 201915-251: Great Theoretical Ideas in Computer Science
- Fall 201715-213: Introduction to Computer Systems
- Fall 2016 & Fall 201715-112: Fundamentals of Programming
- Spring 201621-127: Concepts of Mathematics
SELECTED PUBLICATIONS
see all →- Thematic Analysis of Accepted Exception Requests for Heart Transplant Candidates Using a Large Language ModelJHLT · 2026
- MovieRecapsQA: A Multimodal Open-Ended Video Question-Answering BenchmarkCVPR · 2026
- Are Triggers Needed for Document-Level Event Extraction?TACL · 2025 · 2 citations
- Pungene at DialAM-2024: Identification of Propositional and Illocutionary RelationsArgMining · 2024 · 2 citations
- Edward Said at Touché: Human Value Detection Using Transformers and UpsamplingCLEF · 2024 · 1 citations
- A Survey on Multimodal Disinformation DetectionCOLING · 2022 · 224 citations
- Overview of the CLEF–2022 CheckThat! Lab on Fighting the COVID-19 Infodemic and Fake News DetectionCLEF · 2022 · 90 citations
- The CLEF-2022 CheckThat! Lab on Fighting the COVID-19 Infodemic and Fake News DetectionECIR · 2022 · 88 citations
- Overview of the CLEF-2022 CheckThat! Lab Task 1 on Identifying Relevant Claims in TweetsCEUR · 2022 · 83 citations
- Assisting the Human Fact-Checkers: Detecting All Previously Fact-Checked Claims in a DocumentEMNLP · 2022 · 45 citations
AWARDS & HONORS
- 2021University Fellowship, Cornell University — for exceptional preparation and promise.
- 2020Best Demo Award, Honorable Mention, ACL 2020 — for the Prta propaganda-analysis system.
- 2019University Honors, Carnegie Mellon University — for outstanding GPA.
- 2017Dean's List, Carnegie Mellon University — F15, S16, F16, F17.
- 201550% Academic Merit Scholarship, Carnegie Mellon University.
TECHNICAL SKILLS
- ProgrammingPython · C · SML · Assembly
- Deep LearningHuggingFace · PyTorch · TensorFlow · Keras
- Machine Learningscikit-learn