By training a model on the allowed “words” and “grammar” of chemistry, Georgia Tech materials scientists can design polymers based on the properties users need.
Researchers have created chemical language AI models to generate new polymer structures based on the properties those polymers need to exhibit. Led by Rampi Ramprasad, standing, the team included postdoctoral scholar Wei Xiong, Ph.D. student Anagha Savit, and research scientist Harikrishna Sahu, who are seated left to right. (Photo: Candler Hobbs)
The words on this page mean something because they are assembled in a particular order and follow the complex rules of grammar and syntax. Creating new chemical polymers follows a similar kind of structure, with rules about what elements and groups of atoms go together and how to assemble them to make sense.
Thinking about polymers in that way has led Georgia Tech materials scientists to create new generative artificial intelligence tools that are like Claude or ChatGPT for new materials.
These are the first foundational models for generative polymer design that have also been validated through physical experiments: users specify the properties they need in a polymer and the model will suggest a chemical structure.
Led by Regents’ Entrepreneur Rampi Ramprasad, the researchers described their latest model this month in the Nature journal npj Artificial Intelligence — including a test material they created and validated in the lab to prove the models work.
“This architecture learns the chemical semantics and chemical grammar. It learns what is allowed, what is not allowed, what comes together well, and what makes a good chemical sentence,” said Ramprasad, Michael E. Tennenbaum Family Chair and professor in the School of Materials Science and Engineering. “A word is really defined by the neighbors it keeps. That's the context for the meaning of a word. It’s the same with atoms or clusters of atoms.”
The new Georgia Tech tool, called POLYT5, overcomes barriers that have limited previous generative AI approaches to polymer design: they sometimes suggested polymers that failed to follow the rules of chemistry or couldn’t be created in real-world labs.
“The chemical structures POLYT5 generates are 100% robust and will follow chemical grammar and semantics,” Ramprasad said. And because the model is trained only on polymers that can realistically be made, its ideas are more likely to be usable.
To create POLYT5 and a related tool called polyBART, Ramprasad, Research Scientist Harikrishna Sahu, postdoctoral scholar Wei Xiong, and Ph.D. students Anagha Savit and Shivank Shukla started with existing AI language architectures. They stripped out the natural language training and instead taught the model using polymer chemical structures — the words and sentences of chemical language. They used more than 12,000 experimentally produced polymers from research studies, plus a bank of over 100 million hypothetical candidates.
(text and background only visible when logged in)
“This is a different kind of language — the language these atoms follow. I've worked in this field for a long time, but even five years ago, I would not have thought that language models could be adapted to learn chemistry.”
RAMPI RAMPRASAD
Michael E. Tennenbaum Family Chair and Professor
To test the new models, the research team asked them to suggest designs for materials called polymer dielectrics, which are used in electric vehicles, defibrillators, and other places requiring a quick burst of energy. The team outlined a series of specific properties that would result in good performance at high temperatures and allow the material to be processed industrially. The researchers picked one of the top candidates from each model’s output and created it in the lab.
Their testing showed they performed just as expected.
Though Ramprasad and his team fine-tuned the base model for dielectrics in the study, the idea is that it can be tuned for any combination of properties. In fact, his group is doing exactly that, teaching the model to design polymers for very different uses as part of new projects underway in his lab.
In the npj Artificial Intelligence study, the researchers also paired POLYT5 with a general-purpose large-language model. The goal is to make it possible for more scientists to use it without being experts in polymer design and creation.
“We live in a world where people want to be able to converse with these agents, and we ultimately want to decrease the barrier for adoption of these tools,” Ramprasad said.
POLYT5 and polyBART build on years of previous work in Ramprasad’s lab. His team has built tools to predict properties of potential polymers to speed discovery of new materials. The new model offers the inverse: using the desired properties to design the polymer from the beginning.
“Language, they say, is the frontier of intelligence. And this is a different kind of language — the language these atoms follow,” Ramprasad said. “I've worked in this field for a long time, but even five years ago, I would not have thought that language models could be adapted to learn chemistry.”
About the Research
This research was supported by the Office of Naval Research, grant Nos. N00014-19-1-2103 and N00014-20-1-2175. Computations were performed at the San Diego Supercomputing Center through allocation DMR080044 from the Advanced Cyberinfrastructure Coordination Ecosystem: Services & Support program. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the authors and do not necessarily reflect the views of any funding agency.
Citations:
Sahu, H., Xiong, W., Savit, A. et al. POLYT5: an encoder-decoder foundation chemical language model for generative polymer design. npj Artif. Intell. 2, 30 (2026). https://doi.org/10.1038/s44387-026-00087-1
Savit, A. et al. polyBART: A Chemical Linguist for Polymer Property Prediction and Generative Design. Findings of the Association for Computational Linguistics: EMNLP 2025. https://doi.org/10.18653/v1/2025.findings-emnlp.647
Preeminence in Research
Related Content
Using AI to Find the Polymers of the Future
A pair of papers from MSE's Rampi Ramprasad describes advances in polymer design for energy storage, filtration technologies, and recyclable plastics.
Georgia Tech Research News
A pair of papers from MSE's Rampi Ramprasad describes advances in polymer design for energy storage, filtration technologies, and recyclable plastics.
New Polymer Membranes, AI Predictions Could Dramatically Reduce Energy, Water Use in Oil Refining
The membranes would improve distillation processes that account for 1% of the world’s energy use.
The membranes would improve distillation processes that account for 1% of the world’s energy use.
Algorithm Learns Chemical Language to Accelerate Polymer Research
MSE researchers developed a machine learning model to cut down on the time-consuming effort to identify new combinations of materials that could be effective polymers.
Georgia Tech News Center
MSE researchers developed a machine learning model to cut down on the time-consuming effort to identify new combinations of materials that could be effective polymers.