Adobe Media and Data Science Research (MDSR) Laboratory
Adobe Media and Data Science Research (MDSR) Laboratory
People
Join us!
Publications
Collaborators
Data Efficiency
SMART: Submodular Data Mixture Strategy for Instruction Tuning
Instruction Tuning involves finetuning a language model on a collection of instruction-formatted datasets in order to enhance the …
Kowndinya Renduchintala
,
Sumit Bhatia
,
Ganesh Ramakrishnan
PDF
Cite
Poster
Video
DOI
INGENIOUS: Using Informative Data Subsets for Efficient Pre-Training of Language Models
A salient characteristic of pre-trained language models (PTLMs) is a remarkable improvement in their generalization capability and …
Kowndinya Renduchintala
,
Krishnateja Killamsetty
,
Sumit Bhatia
,
Milan Aggarwal
,
Ganesh Ramakrishnan
,
Rishabh Iyer
,
Balaji Krishnamurthy
PDF
Cite
Poster
Video
DOI
Cite
×