Ep52 Partha Talukdar
In this episode of ACM ByteCast, Bruke Kifle hosts Partha Talukdar, Senior Staff Research Scientist at Google Research India, where he leads a group focused on natural language processing (NLP), and an Associate Professor at the Indian Institute of Science (IISc) Bangalore. Partha was previously a postdoctoral fellow at Carnegie Mellon University’s Machine Learning Department and received his PhD in computer information science from the University of Pennsylvania. He is broadly interested in natural language processing, machine learning, and making language technologies more inclusive. Partha is a co-author of a book on graphs-based learning and the recipient of several awards, including the ACM India Early Career Researcher Award for combining deep scholarship of NLP, graphical knowledge representation, and machine learning to solve long-standing problems. He is also the founder of Kenome, an enterprise knowledge graph company with the mission to help enterprises make sense of big dark data.
Partha shares how exposure to language processing drew him to languages with limited resources and NLP. He and Bruke discuss the role of language in machine learning and whether current AI systems are merely memorizing and reproducing data or are actually capable of understanding. He also talks about his recent focus on inclusive and equitable language technology development through multilingual-multimodal Large Language Modeling, including Project Bindi. They discuss current limitations in machine learning in a world with more than 7,000 languages, as well as data scarcity and how knowledge graphs can mitigate this issue. Partha also shares his insights on balancing his time and priorities between industry and academia, recent breakthroughs that were impactful, and what he sees as key future achievements for language inclusion.