Nils Feldhus @nfelnlp

Post-doctoral Researcher at BIFOLD / TU Berlin interested in interpretability and analysis of language models. Guest researcher at DFKI Berlin. nfelnlp.github.io Berlin, Germany Joined June 2017

Tweets

90
Followers

242
Following

385
Likes

881

Laura Kopf @lkopf_ml

7 days ago

Happy to share that our PRISM paper has been accepted at #NeurIPS2025 🎉 In this work, we introduce a multi-concept feature description framework that can identify and score polysemantic features. 📄 Paper: arxiv.org/abs/2506.15538 #NeurIPS #MechInterp #XAI

1 5 9 648 0

Download Video

Laura Kopf @lkopf_ml

3 months ago

🔍 When do neurons encode multiple concepts? We introduce PRISM, a framework for extracting multi-concept feature descriptions to better understand polysemanticity. 📄 Capturing Polysemanticity with PRISM: A Multi-Concept Feature Description Framework arxiv.org/abs/2506.15538 🧵

1 4 13 2K 6

Download Image

NAACL HLT 2027 @naaclmeeting

11 months ago

📢 NAACL needs Reviewers & Area Chairs! 📝 If you haven't received an invite for ARR Oct 2024 & want to contribute, sign up by Oct 22nd! ➡️AC form: forms.office.com/r/8j6jXLfASt ➡️Reviewer form: forms.office.com/r/cjPNtL9gPE Please RT 🔁 and help spread the word! 🗣️ #NLProc @ReviewAcl

1 26 42 10K 18

Nils Feldhus @nfelnlp

a year ago

Presenting my poster at @inlgmeeting today on political bias evaluation assessing sycophancy in (German-language) LLMs: ACL Anthology: aclanthology.org/2024.inlg-main… This paper resulted from the great Bachelor thesis of Maximilian Bleick co-supervised with @albu and Sebastian Möller.

0 0 2 200 1

Download Image

BlackboxNLP @BlackboxNLP

a year ago

The submission deadline (15 aug) for BlackboxNLP is slowly approaching! We're very excited to see your approaches to open up the black box 🤩 The submission portal has now been opened on OpenReview: openreview.net/group?id=EMNLP…

0 7 14 3K 4

Download Image

ACLRollingReview @ReviewAcl

a year ago

If you haven't been invited to review for ARR 2024 June but are interested in helping us, please fill out this form by June 19: forms.office.com/pages/response…

3 36 40 18K 25

Martin Courtois @MCourtois173

a year ago

Excited to share our paper, "Symmetric Dot-Product Attention for Efficient Training of BERT Language Models," accepted at #ACL2024 Findings. This is joint work with Malte Ostendorff, Leonhard Hennig, and Georg Rehm. arXiv: arxiv.org/abs/2406.06366 Github: github.com/mcrts/ACL2024-…

1 2 8 922 3

Download Image

Inseq @InseqLib

a year ago

@InseqLib v0.6 is out now on PyPI! 🔥 New CLI command for context attribution (@gsarti_), new perturbation-based methods by @hmohebbi75 & @casszzx and optimizations incl. multi-gpu support! ⚡️ Huge shoutout to our contributors! ❤️ Release notes ⬇️ github.com/inseq-team/ins…

0 2 11 6K 1

BIFOLD @bifoldberlin

a year ago

New open #phd position: Contribute to the "FakeXplain - Development of transparent and meaningful explanations in the disinformation detection context " project. Research Assistant - salary grade E 13 TV-L Berliner Hochschulen jobs.tu-berlin.de/en/job-posting…

0 5 4 2K 1

Download Image

Nils Feldhus @nfelnlp

2 years ago

Thanks a lot to all emergency reviewers who helped fill in the gaps for the #ARR February 2024 cycle! 🫶 We're good to go for the author response period.

Nils Feldhus @nfelnlp

2 years ago

Thanks a lot to all emergency reviewers who helped fill in the gaps for the #ARR February 2024 cycle! 🫶 We're good to go for the author response period.

1 1 10 3K 0

0 0 0 382 0

Abhilasha Ravichander @lasha_nlp

2 years ago

Looking for potential emergency reviewers for submissions in Interpretability and Model Analysis/NLP Applications! Topics include: LLM Hallucination, Alignment, Privacy. Please reach out if you have the bandwidth to help!🙏 #NLProc #ACL2024

4 11 26 8K 5

Inseq @InseqLib

2 years ago

Value Zeroing, a faithful approach for analyzing context mixing in Transformers, is now available on @InseqLib main branch for all @huggingface text generation models! 🔀 🔍Paper introducing VZ: aclanthology.org/2023.eacl-main… 🐛VZ in Inseq: tinyurl.com/inseq-vz

1 3 17 4K 3

Download Image

Inseq @InseqLib

2 years ago

@InseqLib v0.5 is finally out! 🐛 New tutorial, distributed and 4-bit quantized models, easier & better contrastive attribution, and more! 🎉 Thanks to @daniel_sc4 @peppeatta and all other contributors! Find out more in the release notes 👀 github.com/inseq-team/ins…