Blog

Our collection of open source datasets for medical AI

Chris Hilger
December 3, 2020

We live and breathe medical datasets for AI!

Open source data can be very valuable when starting to think through building a medical data labeling pipeline (more on that in our free white paper)

We thought it would be helpful to put some of our favorite open source datasets in an organized list and share them out to the community.

In our list, you can explore dozens of datasets by size, category, modality (including X-ray, Ultrasound, Whole Slide Images, CT Scans, ECGs) and more. Additionally, we have included a brief description that helps you to quickly understand the specific abnormalities of interest, balance of the data and information about annotations included such as medical image classifications or segmentations.

‍

Table of open source medical image datasets
A screenshot of our table containing dozens of open source medical datasets

‍

Access the full collection here.

If you know of any datasets that should be added to this list, please let us know!


Related posts

January 18, 2022

Centaur Labs recognized in 2021 CB Insights Digital Health 150

We’re humbled and honored to be recognized by CB Insights as one of the top 150 digital health startups in the world!

Continue reading →
July 8, 2021

Centaur Labs teams up with Brigham and Women's Hospital on Massachusetts Life Sciences funded project

Learn more about how Centaur Labs is working with the Brigham and Women's Hospital team to develop multiple AI applications for point of care ultrasound.

Continue reading →
April 19, 2022

JIS Orthopedics and Centaur Labs publish knee AI model in Journal of Arthroplasty

The model recommends patients for partial (UKA) or total (UKA) knee arthroplasty with high confidence, based on standard knee x-ray views.

Continue reading →