Хороший, плохой, злой: База данных, data catalog и AI

Всех приветствую! Меня зовут Павел, работаю в компании Lasmart. Одно из направлений деятельности всегда было внедрение и развитие DWH. В какой-то момент задумались о том, чтобы оптимизировать прежде всего свою работу в некоторых аспектах. И первым инструментом сделали генерацию бизнес-описания на основе AI. Назвали Datadesc (data + description). Об этом опыте и пойдет речь в этой статье.

https://habr.com/ru/articles/996288/

#dwh #sql #data_catalog #openmetadata #datahub #data_engineering #data_analyst #semantic #arenadata_catalog #ai

Хороший, плохой, злой: База данных, data catalog и AI

Всех приветствую! Меня зовут Павел, работаю в компании Lasmart. Одно из направлений деятельности всегда было внедрение и развитие DWH. В какой-то момент задумались о том, чтобы оптимизировать прежде...

Хабр

DataCite’s public data file is now available in the Dimensions BigQuery Lab through a partnership with Digital Science! Explore & analyze metadata for 100M+ research outputs and resources—open by default and free to access on Google BigQuery. https://doi.org/10.5438/p8kv-df04

#OpenMetadata #OpenInfrastructure

📚✨ Now live: What does the FIL Guadalajara reveal about the circulation of academic books?

This post reflects on conversations from the fair and connects them to the role of Thoth Open Metadata and SciELO Books in enabling the discovery, circulation, and long-term sustainability of open access academic books.
🔗 https://copim.pub/fil-guadalajara-academic-books-thoth-open-metadata-scielo-livros/

#OpenAccess #OpenMetadata #OpenInfrastructure #OAbooks

What the FIL Guadalajara debates reveal about metadata for academic books and how Thoth Open Metadata and SciELO Books respond to this challenge - Copim

In early December, FIL Guadalajara 2025 took place, one of the main international events in the Latin American publishing sector. Among the various profession…

Copim

Sitting without electricity and heating, but still thinking about… open #bibliometric data. Sharing our presentation from Bergen 2025. Even in these conditions, we keep building resilient research infrastructures:

👉 https://doi.org/10.6084/m9.figshare.30753224

The bibliometrics market is a textbook case of market failure: monopolies dominate, national research stays invisible, and profit beats #data quality. That’s why national infrastructures and #openmetadata really matter.

#OpenScience #OpenData #SciencePolicy

Highlights from the post:
✨ funding statements alone aren’t sufficient
🔎 enables analysis & verification across outputs
🏗️ infrastructure matters (e.g. Crossref Grant IDs) + stronger workflows
🔗 clear roles for funders, publishers & infrastructure providers

#OpenResearchInformation #FundingMetadata #OpenMetadata #ResearchTransparency #OpenScience

New blog post on how Wellcome and Europe PMC are using the Crossref Grant Linking System to improve funding transparency and reduce reporting burden through open metadata. https://doi.org/10.64000/c1dh8-qn968

#OpenResearch #OpenMetadata #OpenInfrastructure

Join @Thoth_metadata @PublicKnowledgeProject at the 20th Munin Conference on Scholarly Publishing TODAY!

🖊️ Moving beyond closed silos: liberating workflows based on open metadata to bring about an interoperable and open not-for-profit ecosystem for open access books and chapters
⏰ 12:00-12:10
🔗 https://buff.ly/3BX2Ahl

#Munin2025 #OpenMetadata #MetadataMatters #OpenAccess #OpenData

ICYMI: @crossref is launching the new Participation Reports, already available at https://www.crossref.org/members/prep/. Read about what's new and why more transparency on #openmetadata is important for movements like @BarcelonaDORI
, not to mention insights for members themselves https://doi.org/10.64000/8d5ga-2n897

Join @Thoth_metadata at #DCMI2025 all week

Today they're sharing a poster

📝 Open Access, Open Data, Open Archiving: Liberating Metadata Flows across the OA Books Landscape
📌 University of Barcelona, Spain
🔗 https://buff.ly/BjlTux1

#OpenMetadata #MetadataMatters #OABooks

👩🏻‍💻 Checking the Day 2 Wed. 23Oct programme at #OpenEngaged #OAWeek:
🌟Lightning talks: #SafeguardingResearch #Datarescue #CARE #OpenGLAM #Accessibility
🌟Technology, Power, and Equitable Design session. #LocalContexts #OpenCitations #OpenMetadata
✅ Register https://openscholarship.gitbook.io/open-and-engaged-2025/day-2-wednesday-22-october
Day 2💫 Wednesday, 22 October | Open and Engaged 2025

British Library's Open and Engaged 2025 Conference