@RejoinEU I have worked in data quality for many years. Much of what needs to be done is 'grunt work' - technology cannot assess the accuracy of data etc. Plus transactional data happened at a point in time - unless there is an independent record of that transaction, it is extremely difficult to improve the quality of its data #dataquality
Data Engineer - Remote

Automate data workflows; Build data pipelines; Collaborate with ai researchers; Collaborate with data scientists; Design data pipelines; Design data schemas; Develop data models; Develop storage systems; Ensure Data reliability; Ensure data integrity; Ensure data quality; Explore datasets; Extract, transform, and analyze data; Implement data monitoring; Implement data validation; Ingest data from multiple sources; Maintain data pipelines; Prepare datasets for experimentation; Prepare datasets for model training; Process data; Transform data into structured formats; Write Python scripts; Write SQL queries;

Data Engineer

Scherlok – zero-config data quality monitoring, works with dbt

Scherlok은 dbt와 연동 가능한 제로-설정 데이터 품질 모니터링 도구로, 사전 규칙 작성 없이 데이터의 정상 패턴을 학습해 이상을 자동 탐지한다. PostgreSQL, BigQuery, Snowflake를 지원하며, CI/CD 파이프라인에 쉽게 통합되어 크리티컬 이상 발생 시 배포를 차단할 수 있다. Slack, Discord, Teams 등 다양한 알림 채널과 연동 가능하며, HTML 대시보드를 통해 이상 이력과 스키마 변화를 시각화한다. 기존 데이터 품질 도구 대비 설정과 유지보수 부담이 크게 줄어들어 데이터 엔지니어링 생산성을 높인다.

https://github.com/rbmuller/scherlok

#dataquality #dbt #anomalydetection #cicd #monitoring

GitHub - rbmuller/scherlok: A detective for your data. Zero-config data quality monitoring — works with dbt, Postgres, BigQuery, Snowflake. No YAML.

A detective for your data. Zero-config data quality monitoring — works with dbt, Postgres, BigQuery, Snowflake. No YAML. - rbmuller/scherlok

GitHub

Mistaking Quantity for Quality in Tech and Life - Tech Field Day Podcast
@TechFieldDay @TechFieldDayPod @SFoskett @GuyCurriersFeed @DaveGraham #TFDPodcast #AIFD8 #AI #AgenticAI #AIInfrastructure #AIAgents #AIQuality #DataQuality

https://youtu.be/9CAVQPJTGzM

Mistaking Quantity for Quality in Tech and Life - Tech Field Day Podcast

YouTube

Now that AI has enabled us to have an unlimited amount of content, generated on demand and instantly, we find ourselves questioning the quality of the output. 🤖 🎙️

🎙️ This episode of the Tech Field Day Podcast, recorded prior to AI Field Day by delegates Barbara Roos, Guy Currier, Dave Graham, and Stephen Foskett, considers this common trade-off.

#TFDPodcast #AIFD8 #AI #AgenticAI #AIInfrastructure #AIAgents #AIQuality #DataQuality

https://youtu.be/9CAVQPJTGzM

88 % des entreprises françaises foncent sur l'IA sans données fiables

Seules 12 % des entreprises françaises ont des données prêtes pour l'IA. Diagnostic, coûts et plan d'action pour ne pas gaspiller votre budget.

https://www.decodeur-ia.com/articles/donnees-ia-entreprise-guide-preparer-data-quality-avant-deployer-2026/

#IA #IntelligenceArtificielle #dataquality #donnesia #auditdonnes #iaentreprise

Données et IA : préparer vos data avant de déployer

12 % des entreprises françaises ont des données prêtes pour l'IA. Audit, qualité, gouvernance : le guide pratique pour PME avant de lancer un projet IA en 2026.

Décodeur IA

This week we were discussing the main challenges of Machine Learning in the #KDAI2026 lecture. It should be very obvious that "bad data quality leads to bad results" :)
However, we were also talking about insufficient number of data, non-representative data, irrelevant features, overfitting and various forms of bias.

@fiz_karlsruhe #AI #machinelearning #unicorn #dataquality #lecture #datascience

Datenqualität ist die Grundlage für produktive AI-Automation. Nur mit sauberer Runtime-Wahrheit erreichen KI-Systeme ihre volle Leistungsfähigkeit. Ignorieren Sie Datenverschmutzung – sie untergräbt die Entscheidungsfindung und reduziert den ROI. Investieren Sie in Datenreinigung, um Ihre Automatisierungen zu optimieren. #AI #DataQuality #DigitalTransformation #ITConsulting

LintedData is a linter for RDF and Ontologies for easy use in CI pipelines, we recently released. It checks for common violations of best practices in ontology engineering.
GitLab: https://gitlab.com/dlr-dw/linteddata/
Docker: https://hub.docker.com/r/dlrdw/linteddata/

Today I present LintedData at the Helmholtz Metadata Conference 2026 demo session.
Abstract & Poster: https://elib.dlr.de/223803/

#RDF #Ontologies #KnowledgeGraphs #DataQuality #OntologyQuality #OntologyEngineering #HMC2026 @helmholtz_hmc