Apache Beam with GCP Dataflow: The Ultimate Duo for Data Integration

Learn the power of Apache Beam with GCP Dataflow for seamless, scalable data integration solutions in real-time environments.

XTIVIA

Building data pipelines with Java and open source

https://videos.ijug.eu/w/vengBvKMcaP8NG8xhTnVTU

Building data pipelines with Java and open source

PeerTube

I'm not feeling the warm fuzzies about trying to do GCP Dataflow (aka Apache Beam) with Typescript. Doesn't feel fully baked.

Also doesn't feel like a lot of doc/examples nor many people using it. I don't think our org should be on the bleeding edge of something like this for a core piece of our infrastructure.

Dunno. Will keep working on it for a bit.

#GCP #GoogleCloud #Dataflow #ApacheBeam

NEW VIDEO: Transcribe Newscasts in Parallel with #ApacheBeam and @huggingface
https://youtu.be/_bKyFREDvZc
AI News Transcriptions in Parallel with Apache Beam and Hugging Face

YouTube
Groovy Whiskey

Do you have a penchant for fine whiskey! This presentation embarks on a quest talk to analyze whiskeys produced by the world’s top 86 distilleries to identify the perfect single-malt Scotch. For fun, several Apache projects will be used. Groovy simplifies your data science code. Commons Math and Commons CSV let you write code for reading your data and your processing logic. Beam, Flink, Ignite, Spark and Wayang let you scale your machine learning applications.

Speaker Deck
Let's try: Apache Beam part 8 - Tags & Side inputs

We sometimes have to apply some complex conditions in our Beam pipeline. This blog we will get along together to see how can we design those complex ideas into a simple-readable yet powerful workflow.

bluebirz.net
Let's try: Apache Beam part 7 - custom IO

in real world, there would be some cases we need to connect to some sources that Apache Beam doesn't have the IO packages for. Let's see how can we implement IO package in our own styles.

bluebirz.net

#CaseStudy - Discover how #Yelp reworked its data streaming architecture with #ApacheBeam & #ApacheFlink!

The company replaced a fragmented set of data pipelines for streaming transactional data into its analytical systems, like Amazon Redshift and in-house data lake, using Apache data streaming projects to create a unified and flexible solution.

Dive into the details: https://bit.ly/3WgkTL7

#InfoQ #SoftwareArchitecture #EventDrivenArchitecture #DataPipelines #Streaming

Yelp Overhauls Its Streaming Architecture with Apache Beam and Apache Flink

Yelp reworked its data streaming architecture by employing Apache Beam and Apache Flink. The company replaced a fragmented set of data pipelines for streaming transactional data into its analytical sy

InfoQ
#bluebirzblog
Let's try: Apache Beam part 6 - instant IO
#ApacheBeam provides inputs and outputs for PCollection in many packages. We just import and call them properly and get the job done.
[Medium] https://medium.com/@bluebirz/lets-try-apache-beam-part-6-instant-io-fae7f79b1801
[TH] https://www.bluebirz.net/th/lets-try-apache-beam-part-6-th/
[EN] https://www.bluebirz.net/en/lets-try-apache-beam-part-6/
Let’s try: Apache Beam part 6 — instant IO - bluebirz - Medium

also available at [EN] https://www.bluebirz.net/en/lets-try-apache-beam-part-6/ [TH] https://www.bluebirz.net/th/lets-try-apache-beam-part-6-th/ Apache Beam provides inputs and outputs for…

Medium

Good morning ☕☀️

I am #recruiting a Data Engineer into my team.

We're building pipelines in #trivago that analyse, verify and normalise the content coming from partners, to deliver them clean to the rest of the company.

I am looking for a pragmatic engineer engaged for quality and stability, that takes the challenge to process big volumes of data. We use ATM #Python and #ApacheBeam in #GCP, but you don't need them to apply if you are using other data tooling.

We offer a competitive salary, a constant challenge, a very enthusiastic team and an authentic atmosphere of multicultural colleagues.

We're based in #Düsseldorf, and we work in english in an hybrid scheme of 2 days #homeOffice / 3 days in person. Also unlimited vacation days, 20 days per year fully remote, kitchen, coffee, daily fruits...

Apply now!
https://careers.trivago.com/job/r7193037002/