This gist will be a place for some interesting data source I found but have no time to collect/think about yet.
__author__ = "Zhiya Zuo"
__email__ = "[email protected]"
This section lists datasets to play with or experiment with (e.g., test/evaluate/validate/learn new algorithms.)
A list of classic network datasets compiled by Prof. Mark Newman at the University of Michigan.
Specifically for machine learning and data mining purposes. Has some very classic dataset to play with.
Yelp has a lot of datasets for text mining, network analysis, etc.
Alpha Vantage APIs are grouped into three categories: (1) Time Series Data, (2) Technical Indicators, and (3) Sector Performances. All APIs are realtime: the latest data points are derived from the current trading day. Examples in this documentation are for demo purposes.
-Value Viz Free for 10 requests per hour.
Academic genealogy dataset. Probabily the one with the best coverage is the one of [neuroscience] (https://neurotree.org/neurotree/). Can I scrape it or just ask for the data?
This one is specifically for Math.
I have been scraping the beer ratings but have not checked out their social network. Seems to be an interesting dataset.
This one has many interesting dataset (including salary) but needs subscription (UIowa does have it).
This one can be utilized to find Iowa public univerity faculty salaries (or maybe local government) with appropriate queries in the HTTP request.
Public universities in Illinois. Actually the website has many other data including enrollment and degree program but it is harder to scrape.
The 2017 iteration of the survey includes information on salary and benefits for more than 380,000 faculty members from 1,022 institutions in all 50 states and Puerto Rico.