Tuesday 29 June 2021

Machine learning dataset and repositories

 It is always good idea to practice and learn with real dataset. There are many websites that publishes open datasets for training different models. There are few websites that list open data repositories. These are called Meta portals. Below are the links to the repositories. Make sure to check the data size before downloading. Some datasets are really huge which will take considerable amount of time and internet bandwidth to download.

Data repositories:

  • https://en.wikipedia.org/wiki/List_of_datasets_for_machine-learning_research
  • http://archive.ics.uci.edu/ml/index.php
  • https://registry.opendata.aws/
  • https://www.quora.com/Where-can-I-find-large-datasets-open-to-the-public
  • https://www.kaggle.com/datasets
  • http://dataportals.org/
  • https://opendatamonitor.eu/frontend/web/index.php?r=dashboard%2Findex
  • https://www.quandl.com/

No comments:

Post a Comment

Note: only a member of this blog may post a comment.

Content Generation Using Google Blogger, Python and llama2

In this video, I dive into the world of AI and ML to showcase a fascinating tool called llama2. Using Python, I demonstrate how to generate ...