This week we were discussing the main challenges of Machine Learning in the #KDAI2026 lecture. It should be very obvious that "bad data quality leads to bad results" :)
However, we were also talking about insufficient number of data, non-representative data, irrelevant features, overfitting and various forms of bias.

@fiz_karlsruhe #AI #machinelearning #unicorn #dataquality #lecture #datascience

Datenqualität ist die Grundlage für produktive AI-Automation. Nur mit sauberer Runtime-Wahrheit erreichen KI-Systeme ihre volle Leistungsfähigkeit. Ignorieren Sie Datenverschmutzung – sie untergräbt die Entscheidungsfindung und reduziert den ROI. Investieren Sie in Datenreinigung, um Ihre Automatisierungen zu optimieren. #AI #DataQuality #DigitalTransformation #ITConsulting

LintedData is a linter for RDF and Ontologies for easy use in CI pipelines, we recently released. It checks for common violations of best practices in ontology engineering.
GitLab: https://gitlab.com/dlr-dw/linteddata/
Docker: https://hub.docker.com/r/dlrdw/linteddata/

Today I present LintedData at the Helmholtz Metadata Conference 2026 demo session.
Abstract & Poster: https://elib.dlr.de/223803/

#RDF #Ontologies #KnowledgeGraphs #DataQuality #OntologyQuality #OntologyEngineering #HMC2026 @helmholtz_hmc

#GESISblog #blog #KODAQS #DataQuality #DBD #DigitalBehavioralData
New on the GESIS Blog: Part 2 of our blog series on the KODQAS Toolbox: Digital Behavioral Data

In the first blog post of the KODAQS Toolbox series, we discussed how data quality issues can affect survey data. Similar challenges arise in digital behavioral data (DBD), though they often manifest differently.

Why did people stop responding to federal economic surveys?
https://www.brookings.edu/articles/why-did-people-stop-responding-to-federal-economic-surveys-what-can-be-done/
Declining response rates challenge the precision and bias of economic indicators like unemployment. Surveys remain vital for capturing nuances, such as job-seeking intent, that administrative data cannot track.
Strong data stewardship and reduced respondent burden are necessary to sustain the statistical system.
#surveymethodology #economics #statistics #dataquality #nonresponsebias

What Every NED Needs to Know About Data Governance

What your board is not asking about data governance Hi Most NEDs will oversee a data governance failure before they oversee a data governance success. Not because they lack the capability. Because nobody told them what to look for. I've written a report specifically for non-executive directors and board advisors (And a CDO) who want to get ahead of this — before it lands on the agenda as a breach, a regulatory inquiry, or a strategic decision made on numbers nobody can vouch for. What Every NED Needs to Know About Data Governance Before Their Next Board Meeting covers: — The five things you actually need to understand (without becoming a data expert) — The warning signs most NEDs miss until it's too late — Twelve questions boards are not asking — including the five most likely to expose systemic failure — A Board Data Governance Readiness Scorecard you can use immediately — A practical 90-day framework for your next Board meeting The average cost of a data breach is now $4.88 million. The ICO's maximum fine is up to 4% of global turnover. Sixty-two percent of organisations say data governance is their single biggest barrier to AI adoption. These are not hypothetical risks. They are board-level accountability questions. If you'd like a word document copy of the report, or want to discuss what good data governance oversight looks like for your board, reply to this email. Liz Henderson The Data Queen Non-Executive Director & Board Advisor in Digital, Data and AI lizhendersondata.wordpress.com | linkedin.com/in/dataqueenliz

https://lizhendersondata.wordpress.com/2026/04/23/what-every-ned-needs-to-know-about-data-governance/

Accurate CRM Data Through Data Enrichment Company Services

Incomplete data reduces the effectiveness of CRM and analytics. Data cleansing removes errors and standardizes formats. A data enrichment company enhances records with validated business information, helping teams improve segmentation, reporting, and customer engagement strategies.

Know more: https://www.hitechdigital.com/data-cleansing-and-enrichment-services

#DataCleansingServices #DataEnrichment #CRMDataCleansing #DataQuality #B2BData #DataManagement #DataDriven

Top 5 common issues in property listings and how to solve them.

Accurate, complete, and consistent data builds buyer trust. Regular updates, clear pricing, and standardized formats improve listing quality. High-quality images, validation, and automation help remove outdated or duplicate listings and boost engagement.

Explore more: https://www.habiledata.com/blog/5-common-issues-observed-in-property-listings-and-solutions/

#RealEstate #PropertyListings #DataQuality #PropTech

Poor data quality can cost organisations millions annually, impacting decision-making and revenue. Advanced tools and human-in-the-loop strategies are essential for effective detection and repair, turning data quality into a strategic advantage.
Discover more at https://dev.to/rawveg/the-real-cost-of-bad-data-8fe
#DataQuality #AIinData #HumanInTheLoop
The Real Cost of Bad Data

Somewhere in a data warehouse, a customer record sits incomplete. A postcode field contains only the...

DEV Community