Users of NCHS public-use data files must comply with data use restrictions to ensure that the information will be used solely for statistical analysis or reporting purposes. National Health and Nutrition Examination Survey (NHANES) Data Sets and Related Documentation. National Health Care Surveys. National Ambulatory Medical Care Survey (NAMCS). Kent Ridge Bio-medical Dataset. This is an online repository of high-dimentional biomedical data sets, including gene expression data, protein profiling data and genomic sequence data that are related to classification and that are published recently in Science, Nature and so on prestigious journals. Available Datasets **. Click here to download the data. OCT data & Color Fundus Images of Left & Right Eyes of 50 healthy persons: This dataset contains OCT data (in mat format) and color fundus data (in jpg format) of left & right eyes of 50 healthy persons. Data Set 1 Data Set2.
A few data sets are accessible from our data science apprenticeship web page.
- Source code and data for our Big Data keyword correlation API (see also section in separate chapter, in our book)
- Great statistical analysis: forecasting meteorite hits (see also section in separate chapter, in our book)
- Fast clustering algorithms for massive datasets (see also section in separate chapter, in our book)
- 53.5 billion clicks dataset available for benchmarking and testing
- Over 5,000,000 financial, economic and social datasets
- New pattern to predict stock prices, multiplies return by factor 5 (stock market data, S&P 500; see also section in separate chapter, in our book)
- 3.5 billion web pages: The graph has been extracted from the Common Crawl 2012 web corpus and covers 3.5 billion web pages and 128 billion hyperlinks between these pages
- Another large data set - 250 million data points: This is the full resolution GDELT event dataset running January 1, 1979 through March 31, 2013 and containing all data fields for each event record.
- 125 Years of Public Health Data Available for Download
You can find additional data sets at the Harvard University Data Science website. I was particularly interested in their LinkedIn data set. KDNuggets is also a great resource, and for more, check out this link.
Medical Data Sets Free Download 2017
Cross-disciplinary data repositories, data collections and data search engines:
- http://thedatahub.org alias http://ckan.net
- Social Network Analysis Interactive Dataset Library (Social Network Datasets)
Single datasets and data repositories
Free Sample Data Sets
- https://pslcdatashop.web.cmu.edu/ (interaction data in learning environments)
- http://www.icpsr.umich.edu/icpsrweb/CPES/ - Collaborative Psychiatric Epidemiology Surveys: (A collection of three national surveys focused on each of the major ethnic groups to study psychiatric illnesses and health services use)