Mastodawn

Pierluca D'Oro Dec 18, 2022

Eugene Vinitsky Dec 17, 2022

I'm generally interested in how the intersection of multi-agent learning, open-endedness, and human-data, can help us build agents with emergent capabilities and reshape validation for autonomous systems. I love batched simulators, RL, autonomous vehicles, and transportation systems (and sci-fi).

I'm also a professor at NYU Tandon CUE and a research scientist at Apple SPG.

Pierluca D'Oro Nov 19, 2022

Clément Canonne Nov 18, 2022

It's the "applications!" time of the year, so let me link to these insightful and oh-so-helpful (and witty) slides by Rocco Servedio on how to write a research statement academic positions and postdocs, from the 2021 Learning Theory Alliance mentoring workshop:
https://let-all.com/assets/slides/How-to-COLT-Rocco.pdf

Lots of good stuff, meaningful advice, and Herman Melville.
#academia #academicjobmarket #researchstatement #hermanmelville

Pierluca D'Oro Nov 12, 2022

Fabian Pedregosa Nov 10, 2022

I've now created @tmlrpub for published papers and @tmlrcert for certifications at TMLR.

This place starts feeling like home 🏡

Pierluca D'Oro Nov 9, 2022

Asking for #rl opinions (#2).

What is deep reinforcement for you? Is it just RL with neural networks?

If so, should we call previous work from the 80s/90s deep RL? If not, what are the peculiar features of deep RL?

Pierluca D'Oro Nov 9, 2022

Pablo Samuel Castro Nov 7, 2022

what's the weirdest thing you've stumbled upon in #RL #reinforcementlearning ?

i'll start:

if using neural nets* you don't actually need any rewards to train an agent optimally on episodic cartpole.

* or positive initializations

Pierluca D'Oro Nov 8, 2022

Asking for #rl opinions.

Is a value function a model in the RL sense? Why? Why not?

Feels like the difference between model-based and value-based methods is getting more and more arbitrary.

Pierluca D'Oro Nov 8, 2022

#introduction

Hi! I'm Pierluca, a PhD student at #mila and a visiting researcher at #meta in Montreal. I work in #reinforcementlearning ( #rl ).

I've been excited about the idea of having models of the dynamics that are maximally useful for learning a control policy.

I have also stumbled upon some sharp edges of the interaction of RL and #deeplearning.

I'm from #sicily and I'm a musician. I wanted to do AI for songwriting but then discovered the beauty of fundamental research!

Pierluca D'Oro Nov 8, 2022

Show thread

Pablo Samuel Castro Nov 8, 2022

@ashishgaurav_13 (forgot i'm no longer on mathstodon.xyz, so that's why my latex wasn't rendering)... @thegradient do you think this is a feature that could be added? 😄

Website	https://proceduralia.github.io/
Google Scholar	https://scholar.google.com/citations?user=AuVp7pkAAAAJ&hl=it
Twitter	https://twitter.com/proceduralia