Publications

Papers are roughly grouped by topics and sorted based on date. The full list of articles is available on my Google Scholar.

Agentic Systems

TimeWarp paper figure
TimeWarp: Evaluating Web Agents by Revisiting the Past
ArXiv'26
Md Farhan Ishmam, Kenneth Marino
Project Page Paper Code Dataset Models
Computer Use Survey paper figure
Computer Use Survey - A Visual Survey of Computer Use Agents
ICLR'26 Blogposts
Kenneth Marino, Md Farhan Ishmam, and Ana Marasovic
Blog

Vision-Language Adapters

R-MMA paper figure
R-MMA: Enhancing Vision-Language Models with Recurrent Adapters for Few-Shot and Cross-Domain Generalization
WACV'26
Md Fahim*, Md Farhan Ishmam*, Mir Sazzat Hossain, M Ashraful Amin, Amin Ahsan Ali, AKM Mahbubur Rahman
Paper Supp Code Poster

Prompt Compression

FrugalPrompt prompt compression paper figure
FrugalPrompt: Reducing Contextual Overhead in Large Language Models via Token Attribution
ArXiv'26
Syed Rifat Raiyan, Md Farhan Ishmam, Abdullah Al Imran, Mohammad Ali Moni
Project Page Paper Code

Visual Question Answering (VQA)

Vision-language corruption robustness paper figure
Enhancing Vision Language Corruption Robustness using Cross Distribution & Prompted Denoisers
WACV'26
Sameer Shafayet Latif*, Sadab Shiper*, K. M. Rahiduzzaman Kiran*, Md Farhan Ishmam*, Md Azam Hossain, Abu Raihan Mostofa Kamal, Md Hamjajul Ashmafee
Paper Supp Code Poster Video
BanglaProtha dataset paper figure
BanglaProtha: Evaluating Vision Language Models in Underrepresented Long-tail Cultural Contexts
WACV'26
Md Fahim*, Md Sakib Ul Rahman*, Akm Moshiur Rahman*, Md Farhan Ishmam*, Md Tasmim Rahman, Fariha Tanjim Shifat, Fabiha Haider, Md Farhad Alam Bhuiyan
Paper Supp Code Dataset Poster
Visual Robustness Benchmark for VQA paper figure
Visual Robustness Benchmark for Visual Question Answering (VQA)
WACV'25
Md Farhan Ishmam*, Ishmam Tashdeed*, Talukder Asir Saadat*, Md Hamjajul Ashmafee, Abu Raihan Mostofa Kamal, Md Azam Hossain
Paper Supp Code Poster Slides Video
ChitroJera VQA dataset paper figure
ChitroJera: A Regionally Relevant Visual Question Answering Dataset for Bangla
ECML-PKDD'25
Deeparghya Dutta Barua*, Md Sakib Ul Rahman Sourove*, Md Farhan Ishmam*, Fabiha Haider, Fariha Tanjim Shifat, Md Fahim, Farhad Alam Bhuiyan
Paper Code Dataset Poster Slides
From Image to Language VQA survey paper figure
From Image to Language: A Critical Analysis of Visual Question Answering (VQA) Approaches, Challenges, and Opportunities
Information Fusion Journal | 2024
Md Farhan Ishmam, Md Sakib Hossain Shovon, Muhammad Firoz Mridha, Nilanjan Dey
Paper ArXiv

Transliteration and Code-Mixing

Robustness of LLMs to Transliteration Perturbations in Bangla paper figure
Robustness of LLMs to Transliteration Perturbations in Bangla
🏆 Best Paper BLP@IJCNLP-AACL'25
Fabiha Haider, Md Farhan Ishmam, Fariha Tanjim Shifat, Md Tasmim Rahman, Md Fahim, Md Farhad Alam Bhuiyan
Paper Code Poster Slides Video
BanglaTLit back-transliteration dataset paper figure
BanglaTLit: A Benchmark Dataset for Back-Transliteration of Romanized Bangla
EMNLP'24 Findings
Md Fahim*, Fariha Tanjim Shifat*, Fabiha Haider*, Deeparghya Dutta Barua, Md Sakib Ul Rahman Sourove, Md Farhan Ishmam, Farhad Alam Bhuiyan
Paper Code Dataset Poster Slides Video
BanTH hate speech dataset paper figure
BanTH: A Multi-label Hate Speech Detection Dataset for Transliterated Bangla
NAACL'25 Findings
Fabiha Haider*, Fariha Tanjim Shifat*, Md Farhan Ishmam*, Deeparghya Dutta Barua, Md Sakib Ul Rahman Sourove, Md Fahim, Farhad Alam Bhuiyan
ArXiv Code
BnSentMix sentiment analysis dataset paper figure
BnSentMix: A Diverse Bengali-English Code-Mixed Dataset for Sentiment Analysis
LoResLM@COLING'25
Sadia Alam, Md Farhan Ishmam, Navid Hasin Alvee, Md Shahnewaz Siddique, Md Azam Hossain, Abu Raihan Mostofa Kamal
Paper Code Dataset Poster Slides

Accessibility Technology

Prompting with Sign Language paper figure
Prompting with Sign Parameters for Low-resource Sign Language Instruction Generation
CV4A11y@ICCV'25
Md. Tariquzzaman, Md Farhan Ishmam, Saiyma Sittul Muna, Md Kamrul Hasan, Hasan Mahmud
Paper Code Dataset Video

General Text Classification

FourierKAN text classification paper figure
FourierKAN outperforms MLP on Text Classification Head Fine-tuning
FITML@NeurIPS'24
Abdullah Al Imran*, Md Farhan Ishmam*
Paper Code

Harmful Content

Penta NLP at EXIST 2024 paper figure
Penta NLP at EXIST 2024 Task 1–3: Sexism Identification, Source Intention, Sexism Categorization In Tweets
EXIST@CLEF'24
Fariha Tanjim Shifat, Fabiha Haider, Md Sakib Ul Rahman Sourove, Deeparghya Dutta Barua, Md Farhan Ishmam, Md Fahim, Farhad Alam Bhuiyan
Paper Code
Penta ML at EXIST 2024 paper figure
Penta ML at EXIST 2024: Tagging Sexism in Online Multimodal Content With Attention-enhanced Modal Context
EXIST@CLEF'24
Deeparghya Dutta Barua, Md Sakib Ul Rah Rahman Sourove, Fabiha Haider, Fariha Tanjim Shifat, Md Farhan Ishmam, Md Fahim, Farhad Alam Bhuiyan
Paper Code