The spatial join?: The end of the spatial join? Vikram Gundeti, CTO of #Foursquare, is reimagining #geospatial for #datascientists by eliminating traditional GIS hurdles and embracing #ML-friendly, agent-ready solutions. Will geodata be seamlessly accessible and...
https://spatialists.ch/posts/2025/08/06-the-spatial-join/ #GIS #GISchat #geospatial #SwissGIS
The spatial join? – Spatialists – geospatial news

The end of the spatial join? Vikram Gundeti, CTO of #Foursquare, is reimagining #geospatial for #datascientists by eliminating traditional GIS hurdles and embracing #ML-friendly, agent-ready solutions. Will geodata be seamlessly accessible and “mapless” by default?

Spatialists – geospatial news
How well do you think you know your data, #dataengineers and #datascientists ? You might want to profile your data more.
I've worked with the #Python package #ydata-profiling . It has some issues. But when I got it working, I found some surprising details about a dataset that I thought I already knew quite well. #pyspark
https://marcel-jan.eu/datablog/2025/04/24/profiling-data-with-ydata-in-pyspark/
Profiling data with ydata in PySpark | Expedition Data

Today at #WorkshopsForUkraine: #Devops for #DataScientists (#R #Rstats & #Python), by Rika Gorn (Posit), Thursday April 3rd, 6 pm CET. Register or sponsor a place for a student by donating to support #Ukraine. Details: https://sites.google.com/view/dariia-mykhailyshyna/main/r-workshops-for-ukraine#h.nk355htoamjv
Dariia Mykhailyshyna - Workshops for Ukraine

Feedback on the past workshops (if you want to learn how to make wordclouds, check out Text Data Analysis workshop below)

Visualizing gene structures in R? gggenes, an extension of ggplot2, simplifies the process of creating clear and informative gene diagrams, making genomic data easier to interpret and share.

Visualization: https://cran.r-project.org/web/packages/gggenes/vignettes/introduction-to-gggenes.html

Click this link for detailed information: https://statisticsglobe.com/online-course-data-visualization-ggplot2-r

#datastructure #datavisualization #dataanalytics #data #tidyverse #datascientists #ggplot2

Introduction to ‘gggenes’

Evaluating the normality of your data is crucial in statistical analysis, as many techniques assume that the data and/or residuals follow a normal distribution.

The visualization in the post contrasts two QQ plots: the left plot shows a data set following a normal distribution, where the points align closely with the reference line.

Check out this tutorial: https://statisticsglobe.com/r-qqplot-qqnorm-qqline-function

Click this link for detailed information: https://statisticsglobe.com/online-course-statistical-methods-r

#datascientists #datavisualization #data

Quantile-Quantile Plot in R | qqplot, qqnorm, qqline Functions & ggplot2

How to create a Quantile-Quantile plot in R - 4 example codes - qqplot, qqnorm & qqline functions of Base R vs. ggplot2 Package

Statistics Globe

The employees also warned that many of those enlisted by #ElonMusk to help him slash the size of the federal government under #Trump’s admin were political ideologues who did not have the necessary skills or experience for the task ahead of them.

The mass #resignation of #engineers, #DataScientists & #ProductManagers is a temporary setback for #Musk & the Republican president’s tech-driven #purge of the federal workforce.

#law #USpol #FederalAgencies

Discover how to implement hierarchical clustering in Python with our detailed tutorial. Perfect for #DataScientists and #ML engineers looking to master clustering algorithms. Includes code examples, visualizations, and practical applications. #Programming

https://teguhteja.id/hierarchical-clustering-python-step-by-step-implementation-guide/

Hierarchical Clustering in Python: A Step-by-Step Implementation Guide

Hierarchical clustering Python tutorial: Learn to implement clustering algorithms with step-by-step code examples and visualizations.

teguhteja.id

Principal Component Analysis (PCA) before Linear Regression can greatly enhance your data analysis process.

By incorporating PCA before performing linear regression, you can streamline your analysis pipeline and build more robust models that capture the essential relationships within your data.

I've developed an in-depth course on PCA theory and its application in R programming.

Further details: https://statisticsglobe.com/online-course-pca-theory-application-r

#pythontraining #datascientists #data #bigdata #advancedanalytics

Online Course: Principal Component Analysis (PCA) - Theory & Application in R

The Ultimate Course to Quickly Master Data Manipulation in R Using dplyr & the tidyverse - Instructor: Joachim Schork - Statistics Globe

Statistics Globe

The latest blog post in my productive data series for #DataScientists has landed, cleaning your data: https://jb.gg/legjhx

If you’re late to the party, we started with where to get data from: https://jb.gg/omw1ki

Then we explored that data: https://jb.gg/0i6mdr

And if you want to get stuck in straight away, check out 7 ways to use Jupyter Notebooks inside #Pycharm: https://jb.gg/9a1kui

Data Cleaning in Data Science | The PyCharm Blog

Real-world data needs cleaning before it can give us useful insights. Learn how how you can perform data cleaning in data science on your dataset.

The JetBrains Blog

Both R and Python are powerful tools widely used for data analysis and research, making them worth a detailed comparison.

Data credit: https://www.kaggle.com/

Learn more: https://statisticsglobe.com/online-course-r-introduction

#bigdata #rstudio #datascientists #datasciencecourse #package

Kaggle: Your Machine Learning and Data Science Community

Kaggle is the world’s largest data science community with powerful tools and resources to help you achieve your data science goals.