Below is a list of the top websites that provide datasets for machine learning projects:
Below is a list of the top websites that provide datasets for machine learning projects:
🌐Christof Schöch, University of Trier, details how the #DOAJ journal #dataset is used to teach #Python programming for the Machine Learning in a Digital Humanities Master's program @christof
#PythonProgramming #APCs #DataClassiication #DataCleaning #MachineLearning
🔗 https://blog.doaj.org/2026/03/30/teaching-python-programming-with-doajs-journal-dataset/
Rohan Paul (@rohanpaul_ai)
Unitree Robotics가 공개한 새로운 오픈소스 로보틱스 데이터셋 UnifoLM-WBT-Dataset을 소개합니다. 실제 환경에서 수집한 고품질 전신 텔레오퍼레이션 데이터로, 휴머노이드 로봇의 개방형 환경 조작 연구와 학습에 활용될 수 있습니다.

New big open source robotic dataset from @UnitreeRobotics UnifoLM-WBT-Dataset - a high-quality dataset drawn from real-world settings for whole-body teleoperation of humanoid robots in open environments. Unitree says the dataset will grow to include broader scenarios and more
От сигнатур к ML IDS: чему IDS Suricata может научить модель?
[Текст не для публикации: не нашел как Редакции прикрепить сообщение, эта статья написана в рамках Блога "Институт системного программирования им. В.П. Иванникова РАН"]
Danfei Xu (@danfei_xu)
사람의 1인칭 시점 데이터를 활용해 로봇을 학습시키는 생태계 EgoVerse가 소개됐다. 4개 연구소와 3개 산업 파트너가 참여했으며, 1300시간 이상·240개 장면·2000개 이상 과제를 포함하는 대규모 데이터셋과 연구 결과를 제공한다.

Introducing EgoVerse: an ecosystem for robot learning from egocentric human data. Built and tested by 4 research labs + 3 industry partners, EgoVerse enables both science and scaling 1300+ hrs, 240 scenes, 2000+ tasks, and growing Dataset design, findings, and ecosystem 🧵
M.M. Sandin et al. (preprint, 2025) "inferred a timeline of #eukaryoteevolution using molecular clock and birth-death diversification models". They used a "#dataset of 75,975 non-redundant...#taxonomicunits and 77 well-supported fossil calibrations" and reconstructed an #evolutionary #diversification of #eukaryote #crowngroup representatives in the Proterozoic (ca. 2.5 billion to 541 million y. ago).
StefanFWirth
Ref
https://doi.org/10.64898/2025.12.12.693929
Fig
M.M.Sandin et al.(2025), http://creativecommons.org/licenses/by-nc-nd/4.0/