Publications

Papers are roughly grouped by topics and sorted based on date. The full list of articles is available on my Google Scholar.

Vision-Language Adapters

R-MMA: Enhancing Vision-Language Models with Recurrent Adapters for Few-Shot and Cross-Domain Generalization
WACV'26
Md Fahim*, Farhan Ishmam*, Mir Sazzat Hossain, M Ashraful Amin, Amin Ahsan Ali, AKM Mahbubur Rahman
Paper

Visual Question Answering (VQA)

Enhancing Vision Language Corruption Robustness using Cross Distribution & Prompted Denoisers
WACV'26
Sameer Shafayet Latif*, Sadab Shiper*, K. M. Rahiduzzaman Kiran*, Md Farhan Ishmam*, Md Azam Hossain, Abu Raihan Mostofa Kamal, Md Hamjajul Ashmafee
Paper Code
BanglaProtha: Evaluating Vision Language Models in Underrepresented Long-tail Cultural Contexts
WACV'26
Md Fahim*, Md Sakib Ul Rahman*, Akm Moshiur Rahman*, >Md Farhan Ishmam*, Md Tasmim Rahman, Fariha Tanjim Shifat, Fabiha Haider, Md Farhad Alam Bhuiyan
Paper Code
Visual Robustness Benchmark for Visual Question Answering (VQA)
WACV'25
Md Farhan Ishmam*, Ishmam Tashdeed*, Talukder Asir Saadat*, Md Hamjajul Ashmafee, Abu Raihan Mostofa Kamal, Md Azam Hossain
Paper Supp Code Poster Slides Video
ChitroJera: A Regionally Relevant Visual Question Answering Dataset for Bangla
ECML-PKDD'25
Deeparghya Dutta Barua*, Md Sakib Ul Rahman Sourove*, Md Farhan Ishmam*, Fabiha Haider, Fariha Tanjim Shifat, Md Fahim, Farhad Alam Bhuiyan
ArXiv Code
From Image to Language: A Critical Analysis of Visual Question Answering (VQA) Approaches, Challenges, and Opportunities
Information Fusion Journal | 2024
Md Farhan Ishmam, Md Sakib Hossain Shovon, Muhammad Firoz Mridha, Nilanjan Dey
Paper ArXiv

Transliteration and Code-Mixing

BanglaTLit: A Benchmark Dataset for Back-Transliteration of Romanized Bangla
EMNLP'24 Findings
Md Fahim*, Fariha Tanjim Shifat*, Fabiha Haider*, Deeparghya Dutta Barua, Md Sakib Ul Rahman Sourove, Md Farhan Ishmam, Farhad Alam Bhuiyan
Paper Code Dataset Poster Slides Video
BanTH: A Multi-label Hate Speech Detection Dataset for Transliterated Bangla
NAACL'25 Findings
Fabiha Haider*, Fariha Tanjim Shifat*, Md Farhan Ishmam*, Deeparghya Dutta Barua, Md Sakib Ul Rahman Sourove, Md Fahim, Farhad Alam Bhuiyan
ArVix Code
BnSentMix: A Diverse Bengali-English Code-Mixed Dataset for Sentiment Analysis
LoResLM@COLING'25
Sadia Alam, Md Farhan Ishmam, Navid Hasin Alvee, Md Shahnewaz Siddique, Md Azam Hossain, Abu Raihan Mostofa Kamal
Paper Code Dataset Poster Slides

Harmful Content

Penta NLP at EXIST 2024 Task 1–3: Sexism Identification, Source Intention, Sexism Categorization In Tweets
EXIST@CLEF'24
Fariha Tanjim Shifat, Fabiha Haider, Md Sakib Ul Rahman Sourove, Deeparghya Dutta Barua, Md Farhan Ishmam, Md Fahim, Farhad Alam Bhuiyan
Paper Code
Penta ML at EXIST 2024: Tagging Sexism in Online Multimodal Content With Attention-enhanced Modal Context
EXIST@CLEF'24
Deeparghya Dutta Barua, Md Sakib Ul Rahman Sourove, Fabiha Haider, Fariha Tanjim Shifat, Md Farhan Ishmam, Md Fahim, Farhad Alam Bhuiyan
Paper Code

General Text Classification

FourierKAN outperforms MLP on Text Classification Head Fine-tuning
FITML@NeurIPS'24
Abdullah Al Imran*, Md Farhan Ishmam*
Paper Code