We've been collecting and mirroring what we can find of public data scrapes of data that has recently gone missing from federal sites or is likely to in the near future. The repos here include public data from CDC, NIH, and NOAA. Be warned that some of these repos are quite large!

https://git.lsit.ucsb.edu/publicdata

#datascience #cdc #nih #noaa

publicdata

Archives of Public Data Sets

Git for LSIT at UCSB
@vwbusguy Hey! I'm downloading the ERIC database right now; I didn't realize I was late to the party. How can I send it your way?

@ashtonandrepont Given that ERIC is not public domain, CC, etc., I probably can't host it here, unless you are only fetching the public domain articles.

https://eric.ed.gov/?copyright

ERIC - Content Disclaimers – Website and FAQs

ERIC is an online library of education research and information, sponsored by the Institute of Education Sciences (IES) of the U.S. Department of Education.

@vwbusguy Ah, understandable. Is there a good place for me to put it?
@ashtonandrepont I am not a lawyer, but the Internet Archive might be a possibility.
@vwbusguy Me neither, but I was planning on uploading it there when I get them all downloaded.