Pierluca D'Oro

83 Followers
45 Following
19 Posts
PhD student at Mila, studying #RL.
From Sicily ๐ŸŒ‹๐ŸŒŠ
Websitehttps://proceduralia.github.io/
Google Scholarhttps://scholar.google.com/citations?user=AuVp7pkAAAAJ&hl=it
Twitterhttps://twitter.com/proceduralia

#Introduction v2

I'm generally interested in how the intersection of multi-agent learning, open-endedness, and human-data, can help us build agents with emergent capabilities and reshape validation for autonomous systems. I love batched simulators, RL, autonomous vehicles, and transportation systems (and sci-fi).

I'm also a professor at NYU Tandon CUE and a research scientist at Apple SPG.

It's the "applications!" time of the year, so let me link to these insightful and oh-so-helpful (and witty) slides by Rocco Servedio on how to write a research statement academic positions and postdocs, from the 2021 Learning Theory Alliance mentoring workshop:
https://let-all.com/assets/slides/How-to-COLT-Rocco.pdf

Lots of good stuff, meaningful advice, and Herman Melville.
#academia #academicjobmarket #researchstatement #hermanmelville

I've now created @tmlrpub for published papers and @tmlrcert for certifications at TMLR.

This place starts feeling like home ๐Ÿก

Asking for #rl opinions (#2).

What is deep reinforcement for you? Is it just RL with neural networks?

If so, should we call previous work from the 80s/90s deep RL? If not, what are the peculiar features of deep RL?

what's the weirdest thing you've stumbled upon in #RL #reinforcementlearning ?

i'll start:

if using neural nets* you don't actually need any rewards to train an agent optimally on episodic cartpole.

* or positive initializations

Asking for #rl opinions.

Is a value function a model in the RL sense? Why? Why not?

Feels like the difference between model-based and value-based methods is getting more and more arbitrary.

#introduction

Hi! I'm Pierluca, a PhD student at #mila and a visiting researcher at #meta in Montreal. I work in #reinforcementlearning ( #rl ).

I've been excited about the idea of having models of the dynamics that are maximally useful for learning a control policy.

I have also stumbled upon some sharp edges of the interaction of RL and #deeplearning.

I'm from #sicily and I'm a musician. I wanted to do AI for songwriting but then discovered the beauty of fundamental research!

@ashishgaurav_13 (forgot i'm no longer on mathstodon.xyz, so that's why my latex wasn't rendering)... @thegradient do you think this is a feature that could be added? ๐Ÿ˜„