Tuesday 29 June 2021

Machine learning dataset and repositories

 It is always good idea to practice and learn with real dataset. There are many websites that publishes open datasets for training different models. There are few websites that list open data repositories. These are called Meta portals. Below are the links to the repositories. Make sure to check the data size before downloading. Some datasets are really huge which will take considerable amount of time and internet bandwidth to download.

Data repositories:

  • https://en.wikipedia.org/wiki/List_of_datasets_for_machine-learning_research
  • http://archive.ics.uci.edu/ml/index.php
  • https://registry.opendata.aws/
  • https://www.quora.com/Where-can-I-find-large-datasets-open-to-the-public
  • https://www.kaggle.com/datasets
  • http://dataportals.org/
  • https://opendatamonitor.eu/frontend/web/index.php?r=dashboard%2Findex
  • https://www.quandl.com/

Statistical Learnings

Statistics is the study and manipulation of data, including ways to gather, analyze and draw conclusions from data. Statistical learning, ak...