Jason Baldridge

1.4K Followers
181 Following
280 Posts
Research Scientist working on language and vision at Google Research.

## Summary of Strengths of the Paper

It is short.

## Summary of Weaknesses

It is not short enough.

#reviewing #reviewer2

Applications for unrestricted funds through the Google Research Scholar program are now open ! https://research.google/outreach/research-scholar-program/
Research Scholar Program – Google Research

The Research Scholar Program aims to support early-career professors who are pursuing research in fields relevant to Google.

Google Research
* equal first authors
+ joint front middle authors
^ equivalent middle middle authors
• totally cool back middle authors
# equally senior senior authors

AI images are getting harder to spot. Google thinks it has a solution. #giftArticle https://wapo.st/3szeZYM

A few months ago the WEF put out surprisingly good recs about #generativeAI. But that leaned a lot on the idea of watermarking sources of data. Now Google has announced the opposite, that it can watermark its own AI outputs. While nice (& no doubt hard) this is nowhere near as important as tracing the data inputs for ensuring not only IP payments, but for issues such as #trustworthiness.

AI images are getting harder to spot. Google thinks it has a solution.

The tech giant unveiled a new watermark for AI-generated images, aiming to curb the spread of misinformation during the 2024 presidential campaign.

The Washington Post

title text: The vaccine stuff seems pretty simple. But if you take a closer look at the data, it's still simple, but bigger. And slightly blurry. Might need reading glasses.

(https://xkcd.com/2806)
(https://www.explainxkcd.com/wiki/index.php/2806)

Anti-Vaxxers

xkcd

DISCARDED.

toXic is live now. rip blue birdie.

Scifi/action thriller in which the protagonist loses contact with their human comrades and has to cross a dangerous wasteland with only a ChatGPT-like tool to access crucial information
Beyond parody: "GOP states quit the program that fights voter fraud. Now they’re scrambling."
https://www.politico.com/news/2023/07/09/gop-states-program-voter-fraud-fight-00105252
GOP states quit the program that fights voter fraud. Now they’re scrambling.

The program, known as the Electronic Registration Information Center, was arguably the best nationwide tool states had to catch people trying to vote twice.

POLITICO

Can (text) LLMs reason about images, if they get a textual description of them? Yes, sort of!
Says Sherzod Hakimov, in "Images in Language Space: Exploring the Suitability of Large Language Models for Vision & Language Tasks" (ACL Findings)
https://arxiv.org/abs/2305.13782

2/4

Images in Language Space: Exploring the Suitability of Large Language Models for Vision & Language Tasks

Large language models have demonstrated robust performance on various language tasks using zero-shot or few-shot learning paradigms. While being actively researched, multimodal models that can additionally handle images as input have yet to catch up in size and generality with language-only models. In this work, we ask whether language-only models can be utilised for tasks that require visual input -- but also, as we argue, often require a strong reasoning component. Similar to some recent related work, we make visual information accessible to the language model using separate verbalisation models. Specifically, we investigate the performance of open-source, open-access language models against GPT-3 on five vision-language tasks when given textually-encoded visual information. Our results suggest that language models are effective for solving vision-language tasks even with limited samples. This approach also enhances the interpretability of a model's output by providing a means of tracing the output back through the verbalised image content.

arXiv.org

WIRED covered the recent partnership between GPT4 and "Be My Eyes", and included a nice discussion with Danna Gurari who has been leading the VizWiz workshop at CVPR for the past 5 years, where we challenge computer vision researchers to work on problems in accessibility -- and, yes, it stems back to the VizWiz paper from almost 13 years ago!

https://www.wired.com/story/ai-gpt4-could-change-how-blind-people-see-the-world/

AI Could Change How Blind People See the World

Assistive technology services are integrating OpenAI's GPT-4, using artificial intelligence to help describe objects and people.

WIRED