Dr. Pedro Rodriguez

56 Followers
71 Following
4 Posts
Researcher at MetaAI FAIR Labs
CS PhD from UMD 🐢, CLIP Lab
CS Undergraduate from UC Berkeley CS 🐻
Interests: Natural Language Processing, Question Answering & Information Retrieval, Retrieval-augmented Language Models, Evaluation Methodology.
He/Him 🏳️‍🌈
@_dmh I believe there is a 10M token track as well, but in general the lower scale makes it easier to do scaling studies since its cheaper. I could easily see some cool papers coming out of the workshop.

@_dmh The BabyLM challenge has participants training LMs on small data, which seems like a good match (goal is <100M words). It is running this spring/summer, so you could look at the results/papers or participate!

https://babylm.github.io/

@AkariAsai Related, you can `git clone` overleaf repositories and if you use VS Code, edit with LaTeX Workshop + Grammarly extension.
As my first post here, I'm making the much-belated announcement of my new 3-member research group (Kai, Suki, and myself)! The WFH office has plenty of places to hang out 😹, with gourmet meals served every day! Occasionally, collaborators will not-so-subtly tell me it's time to take a break and play/cuddle/pet :).