Course Project (CSE 576 – NLP) · Jan 2025 - May 2025Optimizing Query Execution in Table QA
A multimodal Table Question Answering pipeline that decomposes questions into modality-specific sub-questions, filters irrelevant images using CLIP, and generates structured tables with captions to achieve 73.94% accuracy on the MultiModalQA dataset—outperforming all existing baselines.
PythonNLPMultimodal QAGemini APICLIP