Tahmid Rahman @tahmedge

Senior Applied Scientist (NLP & ML) @ Dialpad sites.google.com/view/tahmedge/… Toronto, Canada Joined November 2016

Tweets

937
Followers

213
Following

450
Likes

7K

Rohan Paul @rohanpaul_ai

a day ago

The paper shows LLM-as-a-judge is inconsistent, and proposes a probabilistic framework to fix it. It shows 2 failures, a lower scored answer can win in head to head, and pairwise picks can form loops. This matters so much because rankings, A/B tests, and reward modeling become…

2 4 14 4K 17

Download Image

Tahmid Rahman @tahmedge

3 days ago

The submission deadline for the 2nd Workshop on Bangla Language Processing (BLP) workshop (co-located with AACL-IJCNLP 2025) has been extended to October 4, 2025 (blp-workshop.github.io) @firojalam04 @cryptexcode @Enamul_Hoque @shammur_absar @MdNishatRaihan @aaclmeeting

0 4 1 151 0

Download Image

Rohan Paul @rohanpaul_ai

4 days ago

This paper introduces a single model that can reason across biology, chemistry, and materials science in one place. It was trained on 206B tokens that mix scientific text, raw sequences like DNA and proteins, and text-sequence pairs. Then it was tuned on 40M instructions and…

2 20 132 10K 92

Download Image

Tahmid Rahman @tahmedge

5 days ago

I am glad to share the acceptance of 6 of my latest research papers at EMNLP 2025 (2 in the Main Track, 3 in the Industry Track, 1 in the NewSumm Workshop) Congrats to all my co-authors at Dialpad, York University, UofA, and NTU. #EMNLP2025

0 0 5 207 0

Download Image

Zephyr @zephyr_z9

a week ago

BRUH QWEN 3 VL is a MONSTER Alibaba dropped the best VL model HOLY SHIT

16 48 574 52K 126

Download Image

Rohan Paul @rohanpaul_ai

2 weeks ago

This chart shows how AI is expected to progress on biology benchmarks by 2030. The first curve, PoseBusters-v2, is about predicting protein-ligand interactions. AI systems are already reaching high accuracy here, suggesting these tasks could be solved in just a few years. The…

1 3 12 3K 3

Download Image

Ridwan Mahbub @mahbub_ridwan

2 weeks ago

Excited to announce that our paper has been selected for a Best Paper Award at IEEE VIS 🏆 I would like to extend my gratitude to my co-authors, specifically to my supervisor Dr. @Enamul_Hoque . This achievement would not have been possible without their support. #IEEEVIS2025

Ridwan Mahbub @mahbub_ridwan

2 months ago

1 5 9 987 1

0 5 11 551 1

Brian Peterson @briandialpad

2 weeks ago

IMO, this is one of the best accomplishments we've ever pulled off at Dialpad. Not only do we have our own completely proprietary and 90%+ accurate customer satisfaction scoring AI model, but we now have explanations for those AI scores in detail and in aggregate. Now any company…

0 1 2 133 0

Download Image

Biology+AI Daily @BiologyAIDaily

2 weeks ago

Evaluating Language Models for Biomedical Fact-Checking: A Benchmark Dataset for Cancer Variant Interpretation Verification 1. A new benchmark dataset called CIViC-Fact has been developed to evaluate the accuracy of language models in verifying cancer variant claims. This…

1 1 4 1K 3

Download Image

Shubham Saboo @Saboo_Shubham_

2 weeks ago

China's Alibaba just dropped a Python framework for building multi-agent apps. AgentScope lets you build AI agents visually with MCP tools, memory, rag, and reasoning capabilities. Works with any LLM and supports real-time steering. 100% Opensource.

50 354 2K 127K 2K

Download Image

Shubham Saboo @Saboo_Shubham_

3 weeks ago

Train AI Agents for complex real-world tasks in just a single line of Python Code. Agent Reinforcement Trainer uses LLM-as-judge to train multi-step agents without manual rewards. 100% Opensource.

16 91 568 35K 673

Download Image

Shubham Saboo @Saboo_Shubham_

4 weeks ago

This Google engineer just released a 424-page free book on Agentic Design Patterns. Covers advanced prompt engineering, multi-agent frameworks, RAG, agent tool use and MCP. 100% free with practical code examples.

27 342 2K 160K 4K

Download Image

Biology+AI Daily @BiologyAIDaily

4 weeks ago

What Large Language Models Know About Plant Molecular Biology 1. A new benchmark called MOBIPLANT has been introduced to evaluate the capabilities of large language models (LLMs) in plant molecular biology. This benchmark was developed by a consortium of 112 plant scientists…

0 1 2 229 1

Download Image

Biology+AI Daily @BiologyAIDaily

4 weeks ago

1 6 20 1K 6

Download Image

Biology+AI Daily @BiologyAIDaily

4 weeks ago

Supervised learning in DNA neural networks @Nature 1. A groundbreaking study demonstrates that DNA molecules can autonomously perform supervised learning in vitro, a significant leap towards embedding learning capabilities in non-living systems. The research shows that DNA…

0 2 8 2K 4

Download Image

Rohan Paul @rohanpaul_ai

4 weeks ago

How to generate medical training data and rewards that make small models generalize. A 7B model beats a 72B model by 19.7% on OmniMedVQA. The model reads medical images and text together like a vision language system. It creates its own image question answer tasks, then a…

0 5 18 3K 12

Download Image

Rohan Paul @rohanpaul_ai

4 weeks ago

🧬 Massive. Newly released Biomni-R0, a tiny 8B param biomedical AI model surpasses Claude 4 Sonnet and GPT-5, demonstrating the efficiency of domain-specialized training. The model uses reinforcement learning to push a biomedical agent to expert level. Biomni-R0 comes in 8B…

Kexin Huang @KexinHuang5

4 weeks ago

20 68 319 67K 149

Download Image

4 25 144 15K 94

Download Image

Salesforce AI Research @SFResearch

4 weeks ago

🤖 Better LLM Agents for CRM Tasks: Tips and Tricks CRM tasks are tough for LLMs - even GPT-4o only solves <30% of tasks in our CRMArenaPro benchmark 😬 📝 Blog: sforce.co/4600cWT 💡 Key finding: Showing agents HOW to solve tasks (not just WHAT to solve) dramatically…