AI Projects
Explore Jarvis Voice Assistant, IntelliBrowser, Dynamic Agentic RAG, and other systems connecting multimodal, agentic, and retrieval research to working software.
Research & Publications
Research on multimodal evaluation and multimodal reasoning including "Re:Verse - Can Your VLM Read a Manga?" - Best Paper Award, Oral Presentation, Top 5%, ICCV AIstory 2025 Workshop.
Blog
Insights on AI architecture, VLM optimization, agentic workflows, and the future of intelligent software.
Contact Madhav Kataria
Get in touch for multimodal evaluation, benchmark design, multimodal reasoning, and applied AI collaborations.
Contact Me