On April 13, 2022 Dr. Robert Grossman, Director of the Center for Translational Data Science, gave an academic keynote talk at the Commonwealth Computational Summit. This was the 5th annual summit hosted by the University of Kentucky’s Center for Computational Science.
Talk Title: The Data Gap in Machine Learning and AI: Why Much of Machine Learning and AI is Still Data Limited, and Some of the Options Available.
Abstract: Although large amounts of online text, images and audio have provided enough data that deep learning models can be developed that significantly improve language translation, image recognition, speech recognition and related applications, developing and deploying machine learning and AI models that provide value and limit bias is still quite difficult in many application areas due to the lack of suitable data. This is especially the case in biology, medicine and health care. We discuss some of the reasons that many important AI problems are still data-limited and some of the approaches that have been taken to address this challenge. We use case studies from machine learning models in COVID-19 and cancer to illustrate some of the challenges and some of the options available.