https://medium.com/@sanjay.mohindroo66/data-governance-best-practices-ensuring-data-quality-and-security-62cc1aae0f1f
Data lineage vergroot vertrouwen in overheidsdata
Overheden maken vaak gebruik van data om beleid te maken, dienstverlening te verbeteren en maatschappelijke vraagstukken aan te pakken. Maar hoe weet je of die data betrouwbaar is? Volgens een nieuw rapport van het Wetenschappelijk Onderzoek- en Documentatiecentrum (WODC) kan data lineage daarbij helpen.
Wat is data lineage?
Data lineage betekent letterlijk โafstamming van dataโ. Het gaat om het in kaart brengen van de volledige reis die data aflegt: van het moment dat het wordt verzameld (bijvoorbeeld via een formulier), tot aan de verwerking, bewerking en het uiteindelijke gebruik in bijvoorbeeld dashboards of rapportages. Met data lineage kun je nagaan:
Waarom is dit belangrijk voor de overheid?
Data lineage helpt om fouten vroegtijdig te signaleren, risicoโs in beeld te brengen en het vertrouwen in beleidsinformatie te vergroten, zowel binnen als buiten de organisatie. Het WODC benadrukt dat data lineage niet alleen een technisch hulpmiddel is, maar ook een stap richting professionalisering van datamanagement binnen de overheid.
Lees het nieuwsbericht van het WODC op hun website en bekijk het Engelstalige rapport.
Dit is een automatisch geplaatst bericht. Vragen of opmerkingen kun je richten aan @DigitaleOverheid@social.overheid.nl
๐บ๏ธ ๐๐ก๐๐ญ ๐๐จ๐ฎ ๐๐ก๐จ๐ฎ๐ฅ๐ ๐๐ง๐จ๐ฐ ๐๐๐๐จ๐ซ๐ ๐๐ฆ๐ฉ๐ฅ๐๐ฆ๐๐ง๐ญ๐ข๐ง๐ ๐ ๐๐๐ญ๐ ๐๐๐ญ๐๐ฅ๐จ๐ . Implementing a data catalog is a necessity if you want to leverage your data. While the allure of cutting-edge technology is strong, the success hinges on a solid foundation of non-technical considerations.
๐ Read our guide & explore what you need to know to avoid common pitfalls and ensure success.
https://www.datalumen.eu/should_know_before_implementing_datacatalog/
#DataCatalog #DataGovernance #DataManagement #DataLineage #MetaDataManagement #DataAgenda #DataStrategy
"AI is all about data. Reams and reams of data are needed to train algorithms to do what we want, and what goes into the AI models determines what comes out. But hereโs the problem: AI developers and researchers donโt really know much about the sources of the data they are using. AIโs data collection practices are immature compared with the sophistication of AI model development. Massive data sets often lack clear information about what is in them and where it came from.
The Data Provenance Initiative, a group of over 50 researchers from both academia and industry, wanted to fix that. They wanted to know, very simply: Where does the data to build AI come from? They audited nearly 4,000 public data sets spanning over 600 languages, 67 countries, and three decades. The data came from 800 unique sources and nearly 700 organizations.
Their findings, shared exclusively with MIT Technology Review, show a worrying trend: AI's data practices risk concentrating power overwhelmingly in the hands of a few dominant technology companies."
https://www.technologyreview.com/2024/12/18/1108796/this-is-where-the-data-to-build-ai-comes-from/
From Chaos to Clarity? ๐Find out how you can make data lineage simple. Data moving through complex architectures doesnโt have to be a mystery. ๐ Check out our latest blog to learn how OpenLineage brings order to your data stack!
๐ Read more to be informed:
https://www.datalumen.eu/openlineage/
#OpenLineage #DataLineage #DataCompliancy #DataGovernance #DataPipelineMonitoring #MetadataManagement
๐๐ง๐๐๐ซ๐ฌ๐ญ๐๐ง๐๐ข๐ง๐ ๐ญ๐ก๐ ๐๐ฉ๐๐๐ญ๐ซ๐ฎ๐ฆ ๐จ๐ ๐๐๐ญ๐ ๐๐ข๐ง๐๐๐ ๐ ๐๐ง๐๐ฅ๐ฒ๐ฌ๐ข๐ฌ
#Datalineage analysis is the backbone of #datagovernance, its the journey of data from origin to consumption. It not only ensures #dataintegrity & #compliance but also aids in decision-making processes & enhances data-driven strategies. Within the realm of data lineage analysis, various methodologies & approaches exist, each tailored to specific needs & objectives: https://www.foxconsulting.co/post/understanding-the-spectrum-of-data-lineage-analysis
"[#DataAnalysts]..should know how the data was born, with all details of measurement... Few things have more devastating consequences ... than someone in the audience pointing out...measurement issues the analyst didn't consider." Bรฉkรฉs and Kรฉzdi, 2021: Data Analysis for Business, Economics, and Policy
If you're having trouble helping your org understand the value of #datalineage and #metadata, share this with them and ask if they know how all the data they're using was gathered and measured.
I wrote about the Lineage Diff for dbt projects feature of PipeRider:
You can compare then lineage DAG from both and after making code changes in dbt. It's really useful for debugging issues/seeing impact etc:
https://medium.com/inthepipeline/dbt-data-lineage-diff-impact-analysis-visualized-bec9927b0c4e
#DataOps #DataLineage #DataViz #DataQuality #DataTesting #DataEngineering