Yuxin (Audrey) Wang

Contact me via yuxinwangcs@outlook.com

selfie2.jpg

Hello! I am Yuxin, a second-year Computer Science Ph.D. student at Dartmouth, advised by Prof. Soroush Vosoughi, with additional support from Prof. Saeed Hassanpour. I specialize in NLP, focusing on the cognitive and societal aspects of AI. My work primarily explores the cognitive and behavioral performances of AI systems by analyzing their language understanding capabilities and probing their reasoning processes. I am also interested in measuring and improving conversational AI’s integration with external tools (e.g., information retriever, strategy planner) to support more effective AI-human interaction.

I obtained my master’s and bachelor’s degrees in Computer Science from Nanjing Univerisity in 2023 and 2019, respectively. During my master’s time at Websoft Lab, my research encompassed knowledge graphs representation learning and continual learning. I have participated in multiple researches and projects.

Sometimes I write research and tech blogs on Medium and MyBlog. I also love jogging and cycling. Feel free to reach out for any potential research collaboration opportunities. :raised_hands:


News

Jun 30, 2025 I am selected again for the OpenAI’s Researcher Access Program. Thank you!
Jun 15, 2025 Our work :page_facing_up:Backdooring VLMs via Concept-Driven Triggers is accepted to ICML’25 DIG-BUG workshop, where we introduced the first concept‐driven backdoor for instruction‐tuned VLMs!
May 28, 2025 Check out our new preprint :page_facing_up:Topic Association Analysis, which investigates LLMs’ over-sensitivity issue via probing the biases in their associations.
May 14, 2025 Our work :page_facing_up:Visibility as Survival on low-resource language generalization was accepted to ACL findings and will be published soon!
Apr 7, 2025 I will be a Research Intern at Microsoft in Redmond this summer.
Mar 10, 2025 Happy to receive the ICLR 2025 Travel Assistance Fund. See you in Singapore this April!

Internship experience

  • Microsoft Logo Microsoft Turing / MSAI — Research Intern June 2025 – September 2025
    Redmond, WA, USA
    Mentor:
    Nick Craswell
    Project: conversational agent, information retrieval
  • MSRA Logo Microsoft Research Asia — Research Intern July 2022 - February 2023
    Beijing, China
    Mentor:
    Börje Karlsson
    Project: Knowledge-enhanced open-world story generation

Selected publications

  1. Preprint
    Probing Association Biases in LLM Moderation Over-Sensitivity
    Yuxin Wang, Botao Yu, Ivory Yang, Saeed Hassanpour, and Soroush Vosoughi
    2025 Under review
  2. ICLR
    ImpScore: A Learnable Metric For Quantifying The Implicitness Level of Sentences
    Yuxin Wang, Xiaomeng Zhu, Weimin Lyu, Saeed Hassanpour, and Soroush Vosoughi
    In ICLR 2025 (spotlight)
  3. ACL
    MentalManip: A Dataset For Fine-grained Analysis of Mental Manipulation in Conversations
    Yuxin Wang, Ivory Yang, Saeed Hassanpour, and Soroush Vosoughi
    In ACL 2024 (oral presentation)
  4. ISWC
    Facing Changes: Continual Entity Alignment for Growing Knowledge Graphs
    Yuxin Wang, Yuanning Cui, Wenqiang Liu, Zequn Sun, Yiqiao Jiang, Kexin Han, and Wei Hu
    In The Semantic Web – ISWC 2022 (acceptance rate: 17.6%)
  5. Neurocomputing
    Open-world Story Generation with Structured Knowledge Enhancement: A Comprehensive Survey
    Yuxin Wang, Jieru Lin, Zhiwei Yu, Wei Hu, and Borje F. Karlsson
    Neurocomputing 2023