Hexagonal Shapes superimposed over a supercomputer server room

 

Welcome to IDEaS

 

The Institute for Data Engineering and Science (IDEaS) provides a unified point to connect government, industry, and academia to advance foundational research, and accelerate the adoption of Big Data technology. IDEaS leverages expertise and resources from throughout Georgia Tech's colleges, research labs, and external partners, to define and pursue grand challenges in data science foundations and in data-driven discovery. We are also dedicated to educating students and those already in the workforce through innovative educational and training programs.

 

 

 

Spotlight

 


Foundations of Artificial Intelligence Seminar Series Kickoff!

 

Integrated Systems for Computational Scientific Discovery: Progress, Challenges, and Implications

 

 Dr. Pat Langley, Georgia Tech Research Institute (GTRI)
January 30, 2026 | 4pm - 5pm  |Classroom 380, Bunger-Henry Building & online via Zoom

 

Abstract: There has been a steady stream of AI work on scientific discovery since the 1970s, much of it leading to published results in fields like astronomy, biology, chemistry, and physics. However, most efforts have focused on isolated tasks rather than addressing their interaction. In this talk, I challenge the research community to develop and adopt integrated discovery systems. I note distinguishing features of scientific discovery and examine five component abilities, in each case specifying the problem and reviewing results in the area. After this, I note some successes at partial integration and consider some remaining hurdles that we must leap to transform the vision for integrated discovery into reality. I also discuss promising domains, natural and synthetic, in which to test such computational artifacts. In closing, I consider ways that integrated discovery can aid the scientific enterprise and factors that influence whether results are trustworthy.

 
Data Sorting Graphic

 

Institute for Data Engineering and Science (IDEaS) Executive Director Search Finalist Visits

 

Four finalists have been chosen for the role of Executive Director of the Institute for Data Engineering and Science (IDEaS). Each finalist will meet with Georgia Tech faculty, staff, and IRI leadership and give a seminar on their vision for the future of IDEaS.

 

 

Centers


 

Close up image of the interior of a high performance computing system
 
Center for High Performance Computing
 

The Center for High Performance Computing 
(CHiPC) advances the state of the art in massive data and high-performance computing technology, and solves high-impact real-world problems. HPC scientists devise computing solutions at the absolute limits of scale and speed. In this compelling field, technical knowledge and ingenuity combine to drive systems using the largest number of processors at the fastest speeds with the least amount of storage and energy. The center's focus is primarily on algorithms and applications. 

"Graphic of connected pins in blue and gold"
 
The Center for Artificial Intelligence in Science and Engineering (ARTISAN)

 

The Center for Artificial Intelligence in Science and Engineering (ARTISAN) aims to accelerate advances in science and engineering by integrating cutting-edge artificial intelligence techniques. We are dedicated to fostering interdisciplinary research, cultivating the next generation of AI experts, and developing innovative solutions that address complex challenges in our world. 

AI generated image of a book with gold glowing edges on a black background

 

The South Big Data Innovation Hub

 

Georgia Tech, along with the University of North Carolina’s Renaissance Computing Institute (RENCI), co-directs the South Big Data Regional Innovation Hub that serves 16 Southern states and the District of Columbia. It is part of the National Science Foundation’s four Regional Innovation Hubs, created to build innovative public-private partnerships addressing regional challenges from data analysis and research to data science workforce development. The Georgia Tech location is operationally run as a center of the Institute for Data Science and Engineering.

Featured Research Areas


 

Machine Learning

Machine Learning

Unstructured and dynamic data analysis, deep learning, data mining, and interactive ML underpin big data foundations and applications.

Learn More »

Health & Life Sciences

Health & Life Sciences

Driving predictive, preventive, & personalized care using big data sets from genomics, systems biology, proteomics, and health records.

Learn More »

High Performance Computing

High Performance Computing

High-performance systems, middleware, algorithms, applications, software, and frameworks for data-driven computing.

Learn More »

Materials & Manufacturing

Materials & Manufacturing

Microscopic views of materials and scalable modeling and simulation technologies for accelerated development of new materials.

Learn More »

Energy Infrastructure

Energy Infrastructure

Sensors and Internet of Things enable infrastructure monitoring. Data analytics improves energy production, transmission, distribution, and utilization.

Learn More »

Algorithms & Optimization

Algorithms & Optimization

Streaming and sublinear algorithms, sampling and sketching techniques, high-dimensional analysis for big data analytics.

Learn More »