Publications

2025

  1. NeurIPS
    Transformer Key-Value Memories Are Nearly as Interpretable as Sparse Autoencoders
    Mengyu Ye, Jun Suzuki, Tatsuro Inaba, and Kuribayashi Tatsuki
    In The Thirty-ninth Annual Conference on Neural Information Processing Systems, Dec 2025
  2. arXiv
    Camellia: Benchmarking Cultural Biases in LLMs for Asian Languages
    Tarek Naous, Anagha Savit, Carlos Rafael Catalan, Geyang Guo, Jaehyeok Lee, Kyungdon Lee, Lheane Marie Dizon, Mengyu Ye, and 12 more authors
    Dec 2025
  3. ACL Findings
    Can Input Attributions Explain Inductive Reasoning in In-Context Learning?
    Mengyu Ye, Tatsuki Kuribayashi, Goro Kobayashi, and Jun Suzuki
    In Findings of the Association for Computational Linguistics: ACL 2025, Jul 2025

2023

  1. EMNLP
    Assessing Step-by-Step Reasoning against Lexical Negation: A Case Study on Syllogism
    Mengyu Ye, Tatsuki Kuribayashi, Jun Suzuki, Goro Kobayashi, and Hiroaki Funayama
    In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing (oral), Dec 2023
  2. SemEval
    TohokuNLP at SemEval-2023 Task 5: Clickbait Spoiling via Simple Seq2Seq Generation and Ensembling
    Hiroto Kurita, Ikumi Ito, Hiroaki Funayama, Shota Sasaki, Shoji Moriya, Mengyu Ye, Kazuma Kokuta, Ryujin Hatakeyama, and 2 more authors
    In Proceedings of the 17th International Workshop on Semantic Evaluation (SemEval-2023), Jul 2023

Projects

2025

  1. CLI Tool LLM Agent System
    BibTeX Cleaning Agent
    An LLM-based agentic system that automatically cleans and standardizes BibTeX entries. It connects to DBLP, Semantic Scholar, and arXiv to fix formatting, normalize fields, enrich metadata, and update a paper’s official publication and venue information. The tool can generate consistent citation keys, detect and remove duplicates, and produce a JSON file that maps original keys to their cleaned versions. It also reports entries that are missing required fields so users can review and correct them manually.