We've been collecting and mirroring what we can find of public data scrapes of data that has recently gone missing from federal sites or is likely to in the near future. The repos here include public data from CDC, NIH, and NOAA. Be warned that some of these repos are quite large!

https://git.lsit.ucsb.edu/publicdata

#datascience #cdc #nih #noaa

publicdata

Archives of Public Data Sets

Git for LSIT at UCSB
@vwbusguy Hey! I'm downloading the ERIC database right now; I didn't realize I was late to the party. How can I send it your way?

@ashtonandrepont Given that ERIC is not public domain, CC, etc., I probably can't host it here, unless you are only fetching the public domain articles.

https://eric.ed.gov/?copyright

ERIC - Content Disclaimers – Website and FAQs

ERIC is an online library of education research and information, sponsored by the Institute of Education Sciences (IES) of the U.S. Department of Education.

@ashtonandrepont Ah! I see the public data. I'll grab it shortly. Thanks for bringing this to my attention!
@vwbusguy Happy to help! o7
@ashtonandrepont If you happen to know of a way to search ERIC by license so I can isolate the public domain stuff from the copyrighted stuff, then I can grab more material, but I didn't see that in the search options.

@vwbusguy I don't see it either :/

I do have a list of all the URL links to the files if that'd be helpful.

DeptEd

Public Data from Department of Education and NCES.

Git for LSIT at UCSB
@vwbusguy Holy crap, you did that fast! It's taking me a whole day at this rate to download what I'm trying to download.