Background / Short-Bio
As of March 2024, I completed my PhD in Natural Language Processing under Prof. Naoaki Okazaki at the Tokyo Institute of Technology, where my research focused on modeling linguistic acceptability using language models and coarse sentence representations.
Previously, I was Group Manager for Natural Language Processing at Rakuten (2021–2023), leading a team of researchers and engineers. I spearheaded impactful initiatives across E-Commerce, Finance, and Travel domains—including a keyword ad bidding optimizer, and a knowledge graph-based item attribute extractor.
Before that, I worked at Amazon Japan (2019–2021) as a Sr. Technical Program Manager in the Applied Machine Learning group. There, I led cross-functional ML projects spanning 13 countries, contributed to the development of a Japanese tokenizer used across six business use cases, and spearheaded the development of a sequence-to-sequence ad generation system integrated with Google Ads.
Between 2017 and 2019, I was promoted to lead the NLP group at Rakuten, leading a cross-functional R&D team of researchers and engineers. I helped establish the Rakuten India research lab, launched a deep learning fraud detection platform for credit card transactions, and co-founded the AI Trainers initiative to mentor ML talent across the company.
Earlier, from 2014 to 2017, I worked as a Researcher at NEC Central Research Laboratories, where I developed efficient deep learning algorithms for NEC’s SX-ACE Supercomputer. My contributions led to up to 95% speed-up for state-of-the-art CNNs. During this time, I also completed a one-month research stint as a visiting scholar at UC Berkeley’s Swarm Lab, leading a collaborative IoT project focused on optimizing user decisions in vertical mobility systems (elevators, escalators, stairs).
I began my career at Accenture as an Associate Software Engineer in 2010, before transitioning into research through my M.Tech at IIT Delhi, where I worked on CGRA compilers under Prof. Anshul Kumar. My undergraduate degree is in Computer Science from TCET Indore, where I studied under Prof. Ritu Tandon.
My expertise spans NLP, deep learning, high-performance computing, and AI-driven business solutions. I’m passionate about translating academic research into real-world impact, particularly through building intelligent systems and driving applied innovation. For more, visit my research and academics pages.
Curious about my work? Check out my publications and patents here.