https://www.quora.com/Where-can-I-find-large-datasets-open-to-the-public
Name | Beschreibung | URL | Protokoll + Format | Beispieldatenelement | |||||
---|---|---|---|---|---|---|---|---|---|
Meetup.com RSVP Stream | Zusagen zu einem öffentlichem Event | Live-View des Datenstroms: http://meetup.github.io/stream/rsvpTicker/ | JSON über Websocket oder Chunked HTTP |
| |||||
Transport for London Data Feed | ÖPNV-Daten
| https://tfl.gov.uk/info-for/open-data-users/data-feeds?intcmp=29422 | |||||||
CLEF-Newsreel | Aufrufe von Nachrichtenartikeln und Aufforderungen, Empfehlungen zu geben | http://www.clef-newsreel.org/ |
JSON über HTTP-Post | |||||||||||
NWS Public Alerts | Wetterbenachrichtigungen | http://alerts.weather.gov/ | XML/CAP and ATOM Format | ||||||||
Yahoo Query Language | YQL Web Service | https://developer.yahoo.com/yql/guide/ | XML/JSON | https://developer.yahoo.com/yql/console/ | |||||||
Kaggle | Public datasets hosted by Kaggle | https://www.kaggle.com/datasets | |||||||||
Wikidata | Free linked database of wikipedia data | https://www.wikidata.org/wiki/Wikidata:Main_Page https://www.mediawiki.org/wiki/Wikibase/API | JSON/XML |
| |||||||
Intel Lab Data | Sensor measurements from 54 sensors deployed in the Intel Berkeley Research lab | http://db.csail.mit.edu/labdata/labdata.html | CSV | ||||||||
UC Irvine Machine Learning Repository | Repository with 335 datasets for the the machine learning community | https://archive.ics.uci.edu/ml/ | CSV | ||||||||
Amazon | Different datasets hosted by Amazon AWS | https://aws.amazon.com/public-data-sets/ | |||||||||
Different datasets hosted by Google | http://www.google.com/publicdata/directory#! | ||||||||||
KDD Cup | Datasets used for the annual Data Mining and Knowledge Discovery competition eorganized by ACM Special Interest Group on Knowledge Discovery and Data Mining | http://www.kdd.org/kdd-cup | CSV | ||||||||
MarineCadastre | AIS data from the US coast as GPS trajectories | http://marinecadastre.gov/ais/ | GDB, can be exported to CSV via QGIS (http://www.qgis.org/de/site/) |
| |||||||
GeoLife | Peoples trajectory data from social networks (GPS measurements of their movement) | https://www.microsoft.com/en-us/download/details.aspx?id=52367 | CSV |
| |||||||
Udacity Self Driving Car | 223GB of driving data with location (lat/lng), gear, break, throttle, steering angle, speed and image | https://github.com/udacity/self-driving-car https://medium.com/udacity/open-sourcing-223gb-of-mountain-view-driving-data-f6b5593fbfa5#.pe7j0hi8f | CSV and images |
| |||||||
GDELT | Geo-Referenced data that includes social happenings such as protests, violence reports, etc. Newest version updates every 15 minutes. | http://gdeltproject.org/ | CSV | ||||||||
Sloan Digital Sky Survey | Data from Apache Point Observatory, New Mexico | http://www.sdss.org | |||||||||
National Renewable Energy Laboratory | Datasets from wind power plants in North America | http://www.nrel.gov/electricity/transmission/western_wind_disclaimer.html | |||||||||
OpenEI | Open Energy Information: buildings, geothermal, hydrogen, smart grid, solar, utilities, water, wind | ||||||||||
T-Drive trajectory data | Data from taxis → trajectories | https://www.microsoft.com/en-us/research/publication/t-drive-trajectory-data-sample/ | CSV |
|