I'll try to post each day a dataviz related to #Wikidata or #Wikipedia until Christmas.

#theadventofwikiviz

The film director Jean-Luc Godard has passed away last September.

Discover his filmography with this dataviz based on #Wikidata 👉 https://observablehq.com/@pac02/tribute-to-jlg

Tribute to JLG

A beeswarm plot of Godard's filmography Jean-Luc Godard has passed away on September 13, 2022. He was one of the greatest film director of all times. This dataviz explore his whole filmography using Wikidata. The data are retrieved from Wikidata using a <a href="https://query.wikidata.org/# ">SPARQL query</a>. We find movies directed by Jean-Luc Godard in Wikidata. It shows all movies by year of release with the number of wikimedia sitelinks. The number of sitelinks measure the number of pages or articles

Observable

Annie Ernaux got the Nobel prize in literature. Have a look at her career in one chart :
https://observablehq.com/@pac02/tribute-to-annie-ernaux

#Wikidata #nobelprize #AnnieErnaux

Tribute to Annie Ernaux

French writer Annie Ernaux is Nobel laureate in 2022. This dataviz explore Annie Ernaux's work using Wikidata. The data are retrieved from Wikidata using a <a href="https://query.wikidata.org/# ">SPARQL query</a>. I find works written by Annie Ernaux in Wikidata. It shows all novels by year of release with the number of wikimedia sitelinks. The number of sitelinks measure the number of pages or articles dedicated to a topic on various Wikimedia projects such as Wikipedia in different languages. Beeswarm p

Observable

Annie Ernaux got the Nobel prize. But how many women got the Nobel?

Look at all Nobel prize laureates by gender and by year :
https://observablehq.com/@pac02/all-nobel-laureates-by-gender-and-by-year

All Nobel laureates by gender and by year

In this notebook, I look at all Nobel laureates from the beginnings to today by gender, number of sitelinks and year. The data are retrieved from Wikidata using a <a href="https://query.wikidata.org/# ">SPARQL query</a>. I find Nobel laureates in the database. The number of sitelinks measure the number of pages or articles dedicated to a topic on various Wikimedia projects such as Wikipedia in different languages. All Nobel together Each dot is a Nobel laureate. The size is proportional to number of sitel

Observable

Look at the career of Nobel laureates in literature? when they got the Nobel? did they publish most of their work before the Nobel? did they still publish after the Nobel?

Explore 3,396 works published by 120 laureates in one dot chart !

https://observablehq.com/@pac02/career-of-nobel-laureates-in-literature

#Wikidata #Observablehq #nobelprize #ObservablePlot

Career of Nobel laureates in literature

In this notebook, I look at the career of all Nobel laureates in literature. The data are retrieved from Wikidata using a <a href="https://query.wikidata.org/# ">SPARQL query</a>. I find works written by Nobel laureates in the database. All works published by all Nobel prize in literature (according to Wikidata) The chart shows all works published by Nobel laureates by age of the author and the age in their career when they got the Nobel prize. The visualization is still a bit messy with some overlap. I

Observable

Last wikiviz about the Nobel prize.

Browse the career of every Nobel prize in economics :

https://observablehq.com/@pac02/career-of-nobel-laureates-in-economics

#econtwitter

Career of Nobel laureates in economics

In this notebook, I look at the career of all Nobel laureates in economics. The data are retrieved from Wikidata using a <a href="https://query.wikidata.org/# ">SPARQL query</a>. I find works written by Nobel laureates in the database. All works published by all Nobel prize in economics (according to Wikidata) The chart shows all works published by Nobel laureates by age of the author and the age in their career when they got the Nobel prize. The visualization is still a bit messy with some overlap. I h

Observable
Tour de France's history at a glance

This dataviz shows all stage winners of the Tour de France by country of citizenship. It tells a story of the Tour de France since the first occurrence in 1903. The data comes from Wikidata. They are collected using a SPARQL query and imported directly into Observable. There may be some problems in the data if there are several stages on the same day. If you meet a `Runtime error`, just try reloading the page (CTRL + R) and be patient. All stages from 1903 to today I use a cell plot to show all stages from

Observable
Tour de France Femmes

This dataviz shows all stage winners of the Tour de France Femmes by country of citizenship. It is a simple fork from @pac02/tour-de-frances-history-at-a-glance?collection=@pac02/wikidata. The data comes from Wikidata. They are collected using a SPARQL query and imported directly into Observable. There may be some problems in the data if there are several stages on the same day. If you meet a `Runtime error`, just try reloading the page (CTRL + R) and be patient. All stages I use a cell plot to show all sta

Observable

Now we can look at the history of the Vuelta (Spain's tour) in one chart.

Colors have been replaced by flags.

https://observablehq.com/@pac02/all-vueltas-stage-winners-by-country-of-citizenship

#Vuelta #wikiviz #Observablehq #ObservablePlot #Wikidata

All Vuelta's stage winners by country of citizenship

The data comes from Wikidata. They are collected using a SPARQL query and imported directly into Observable. There may be some problems in the data if there are several stages on the same day. If you meet a `Runtime error`, just try reloading the page (CTRL + R) and be patient. I focus on vueltas after 2011 since the data before 2011 are really incomplete. Number of stage wins by country Details Under the hood Libraries

Observable

What is your probability of having an article in #frwiki ?

It depends on your birthplace.

Evidence from France => https://observablehq.com/@pac02/births-department-wikipedia?collection=@pac02/wikipedia

#wikipedia #wikidata #spatialinequalities #Observablehq #observablePlot

Does your birthplace affect your probability to have your Wikipedia biography ? some evidence from people born in France.

How does your birthplace affect your probability of being on Wikipedia ? Having a Wikipedia page is a sign of notoriety. We know that the probability of being successful depends on your birthplace. So in this notebook I look at the probability of having a Wikipedia page depending on your birthplace. While browsing Wikipedia in French, I was surprised by the number of people born in Neuilly-sur-Seine (Hauts-de-Seine) or in Paris. So I wanted to know if there was an overrepresentation of people born in these

Observable

The probability of having a Wikipedia page, displayed on a map.

https://observablehq.com/@pac02/visualizing-the-probability-of-having-a-wikipedia-page-usi

Visualizing the probability of having a Wikipedia page using a Dorling cartogram.

This is a complementary visualization to the Does your birthplace affect your probability to have your Wikipedia biography ? notebook. We use Bertin.js to draw a Dorling cartogram. Bubble size is proportional to the number of births in each department while the filling color of bubbles is proportional to the probability of having a Wikipedia page. We focus on people born in metropolitan France between 1975 and 1990. Unfortunately, data for the overseas departments are not available. Appendix

Observable

Do actresses become singers and singers become actors? Evidence from France using #Wikidata

https://observablehq.com/@pac02/actress-singers-and-actor-singers-do-actresses-become-sing

Actress-singers and actor-singers: do actresses become singers and singers become actors?

Jeanne Moreau and Anna Karina were first known as actresses and then became singers. We have also many examples of singers becoming actors. For instance Eddy Mitchell, Johnny Hallyday, Orelsan or Philippe Katerine were at first singers and then became actors. So my intuition is that the actress-singers tend to be actresses at first and then become singers whereas male singers tend to have opportunities to become actors later in their career. Now, I need to find data to test my assumption. I collect data fro

Observable

Wikipedia in French has now more than 2,000 featured articles!

Today's notebook explores those 2,000 articles using #Wikidata.

https://observablehq.com/@pac02/celebrating-the-2-000-featured-articles-milestone-in-wikip

#frwiki #Wikidata #wikiviz #Observablehq

Celebrating the 2,000 featured articles milestone in Wikipedia in French

Wikipedia in French has now 2,000 featured articles ("Articles de qualité"). It's time to celebrate this milestone and to look at those 2,000 articles using Wikidata. I use the Wikipedia Categorymembers API through a SPARQL query to get all articles featured in category "Article de qualité". Let's have a look at the data ! Instance of (P31) : a majority of humans I only show the top 20 occurrences. Country (P17) : a majority of articles about France Note : I use the fantastic Bertin.js library to draw a Dor

Observable

The same analysis as yesterday but for good articles in #frwiki

Good articles are similar to featured articles but criteria are just a bit easier to achieve.

Distribution of articles by instance of (#P31), country (P17) , gender (P21) , country of citizenship (P27) and occupation (P106)

https://observablehq.com/@pac02/good-articles-in-wikipedia-in-french

#wikiviz #Observablehq #Wikipedia #Wikidata

Good articles in Wikipedia in French.

I explore the list of good articles in Wikipedia in French. See https://fr.wikipedia.org/wiki/Wikip%C3%A9dia:Bons_articles if you want to know more about the selection process of good articles. Instance of (P31) : a majority of humans I only show the top 20 occurrences. Country (P17) : a majority of articles about France Note : I use the fantastic Bertin.js library to draw a Dorling cartogram. Focus on humans Gender : only 16% of women Occupation (P106) : Writers, politicians and actors come first ! Country

Observable

Today's wikiviz is a short introduction to #Observablehq for #Wikidata users.

https://observablehq.com/@pac02/an-introduction-to-observable-for-wikidata-users

It shows how to go from a #SPARQL query to a dataviz in Observable.

An introduction to Observable for Wikidata users

Wikidata is an open and contributive knowledge base. Query.wikidata.org is a tool to query Wikidata using SPARQL queries. Query.wikidata.org is a great tool. However it has its own limitations. For instance, it is not possible to combine several queries to write a story with the data. It is limited to computations allowed by SPARQL and it is not possible to customize the visualisation. Observable is a platform allowing to write notebooks in Javascript. Like the Jupyter or Rmarkdown notebooks often used in d

Observable

Writing #SPARQL queries can be very challenging. Maybe you can find inspiration by browsing and searching in this dataset of 2,400 queries collected from #Wikidata.

https://observablehq.com/@pac02/hello-sparql-queries-dataset

Hello SPARQL queries dataset

Querying Wikidata.org using query.wikidata.org is a hard job. A lot of training is needed to write queries. Wikidata contributors share queries in wiki pages but those wiki pages are hard to search. The data collection notebook extracts all links to SPARQL queries from a selection of wiki pages and produces a dataset with ** queries** ! This notebook provides a small search engine to browse all those queries. Of course this is useful for people with some knowledge in SPARQL. The SPARQL Wikibook is a very

Observable

Today's #wikiviz analyzes your list of created articles in #Wikipedia through the lens of #Wikidata : https://observablehq.com/@pac02/look-at-your-list-of-created-articles-through-wikidata

The tool provides insights about the countries, gender, instance of, country of citizenship and occupation of the articles you've created.

WICA: Wikidata's Insights for your Created Articles

This tool helps explore the list of created articles of a Wikipedia user through the lens of Wikidata. It retrieves the list of created articles using xtools API, get the corresponding Wikidata items and computes statistics based on the following properties : Country (P17) Gender (P21) Country of citizenship (P27) Instance of (P31) Occupation (P106) Field of work (P101) Native language (P103) Languages spoken, written or signed (P1412) Writing language (P6886) Religion or worldview (P140) Sexual orientation

Observable

Today's #wikiviz measures gender diversity in #Wikipedia articles :
https://observablehq.com/@pac02/explore-gender-diversity-in-a-single-wikipedia-article

Play with it. You'll see. It's really difficult to find articles with a high percentage of women !

#genderbias #genderdiversity #WomenInRed #LessanspagEs

Gender diversity explorer

Explore gender diversity in a single Wikipedia article This notebook explores gendered entities cited in a Wikipedia articles. See this notebook for the context and the methodology and the Wikidata page of the global project. If you want to focus on people born in a recent history, you can use @pac02/gender-diversity-inspector, which introduces a filter on birth date. News: June 8, 2023: The page has been updated. We now use SearchForm to search Wikipedia articles with a suggestion engine using a trick disc

Observable
Gender diversity in Wikipedia articles: evidence from some selected academic disciplines in the English Wikipedia

Everyone knows humaniki, a fantastic tool which computes the share of women and men among all biographies in Wikipedia. It shows that "only" 19% of biographies are about women in Wikipedia in English (in december 2021). However this global statistic is only one aspect of gender bias in Wikipedia. Another way to look at gender bias would be to ask if women and non binary people are cited in Wikipedia articles about general topics. For instance, take the article Economics. Economics is an academic discipline.

Observable

Of course we expect that if an article links to many people from the past, #genderdiversity will be lower.

Let's just add a small filter to correct this bias : https://observablehq.com/@pac02/gender-diversity-inspector

Gender diversity inspector

This notebook explores gendered entities cited in a Wikipedia articles. It improves @pac02/explore-gender-diversity-in-a-single-wikipedia-article by filtering people on the birthdate. @pac02/explore-gender-diversity-in-a-single-wikipedia-article computes a global share of people cited in an article by gender. This provides useful statistics. However, it makes sense to restrict the results to people born in a recent history. Therefore we introduce a filter on birthdate of people. See this notebook for the c

Observable

Do the article in French and the article in English about economics links to the same people or completely different ones?

Use the comparator tool to compare quickly people named in two #Wikipedia articles :
https://observablehq.com/@pac02/comparator-compare-named-entities-cited-in-two-wikipedia-a

Again, the notebook combines #Wikidata #Observablehq #ObservablePlot and #SPARQL magic.

Comparator : compare named entities cited in two Wikipedia articles

This tool compares human entities cited in two Wikipedia articles. It get the list of entities cited (blue links), filter humans using Wikidata from both articles. Then it create three sets : Intersection : entities cited in both articles Entities cited only in article 1 Entities cited only in article 2 See also Explore gender diversity in a single Wikipedia article to explore gender diversity in entities cited in a Wikipedia article. Citizenship diversity in a Wikipedia article Summary Among the entities

Observable

Wiki articles should reflect the knowledge of all over the world. So let's look at citizenship diversity of people named in an article :

https://observablehq.com/@pac02/citizenship-diversity-in-a-wikipedia-article

It's funny to compare citizenship diversity on different Wikipedia articles about the same topic.

Citizenship diversity in a Wikipedia article

This notebook explores gendered entities cited in a Wikipedia articles. See this notebook for the context and the methodology and the Wikidata page of the global project. Inputs Choose a wikiprojet and an article. See the article on Wikipedia : Share the url : https://observablehq.com/@pac02/citizenship-diversity-in-a-wikipedia-article?wikipedia= &article= Count of entities by country of citizenship Concentration index Concentration index is . It is equal to 1 if there is full concentration. Detail of enti

Observable

Yet another tool to explore entities linked in a #Wikipedia article.

The wikilinks inspector looks at entities by country (P17), by gender (P21), country of citizenship (P27), instance of (P31) and occupation (P106).

Here is an example with the article in English about the European Union : https://observablehq.com/@pac02/articles-wikilinks-inspector?wikipedia=en.wikipedia.org&article=European%20Union&claim=P17&lang=en

#enwiki #SPARQL #wikiviz #Wikidata #Observablehq #BubbleChart

Article's wikilinks inspector

This tool takes all entities named in a Wikipedia article (ie blue links or wikilinks) and compute insights about those entities using Wikidata. You can choose to look at the nature of entities (P31), the gender (P21), the country of citizenship (P27), the country (P17) or the occupation (P106). It draw upon @pac02/look-at-your-list-of-created-articles-through-wikidata and @pac02/explore-gender-diversity-in-a-single-wikipedia-article. Feedback is welcome. Inputs Share the url : https://observablehq.com/@pac

Observable

#genderdiversity again. Let's look at the longitudinal dataset collected during one year by @OpenSexism

https://observablehq.com/@pac02/wednesday-index

The Wednesday Index

A longitudinal analysis of gender diversity in Wikipedia articles In 2021, I released a tool which measures gender diversity in Wikipedia articles (@pac02/explore-gender-diversity-in-a-single-wikipedia-article). OpenSexism uses this tool to measure gender diversity in a selection of articles from Wikipedia in English each wednesday. This is the Wednesday index. This approach provides a longitudinal dataset to study gender diversity over time in this panel of Wikipedia articles. OpenSexism has shared the dat

Observable

@OpenSexism Last #wikiviz and end of this thread.

#Wikidata use labels for items. Sometimes those labels are the male form, sometimes they are the female form and sometimes they are gender neutral.

Check the results in your language : https://observablehq.com/@pac02/gendered-labels-in-wikidata

This is another form of #genderbias in my opinion.

Are occupation labels in Wikidata gender neutral?

This notebook looks at labels of class occupation and counts the number of labels which take the male form as a generic form, the female form as a generic or take a more gender neutral form. I consider two forms of gender neutral labels : labels which are different from the female and male form and labels which are the same as female and male form. I use P2521 for female form and P3321 for male form. labels using male form as generic labels using female form as generic form labels which are the same as mal

Observable

@pac2,

Do you have variants of your #Observablehq notebook demos where the #Wikidata ID is surfaced?

Why is this useful?
It enables one lookup additional information about what an ID denotes.

Here's an example from the not too distant past using #DBpedia ID.

https://observablehq.com/@danielhmills/dbpedia-data-demo

#LinkedData #KnowledgeGraph #SPARQL #LODCloud

DBpedia Data Demo

Data Table

Observable
@kidehen What do you mean by surfaced?

@pac2,

Making #Wikidata IDs visible in the query result.

@kidehen oh yes I could do that.

@pac2 Ugh: "Focus on humans
Gender : only 16% of women"

It would also be interesting to see the gender diversity of links on these 2,000 featured pages.

@OpenSexism yes but in my opinion the gender diversity index is only relevant if the topic is general. I think it's not relevant for a biography or a very narrow topic.