Mastodawn

Pierluca D'Oro Dec 18, 2022

Eugene Vinitsky Dec 17, 2022

I'm generally interested in how the intersection of multi-agent learning, open-endedness, and human-data, can help us build agents with emergent capabilities and reshape validation for autonomous systems. I love batched simulators, RL, autonomous vehicles, and transportation systems (and sci-fi).

I'm also a professor at NYU Tandon CUE and a research scientist at Apple SPG.

Pierluca D'Oro Nov 19, 2022

Clément Canonne Nov 18, 2022

It's the "applications!" time of the year, so let me link to these insightful and oh-so-helpful (and witty) slides by Rocco Servedio on how to write a research statement academic positions and postdocs, from the 2021 Learning Theory Alliance mentoring workshop:
https://let-all.com/assets/slides/How-to-COLT-Rocco.pdf

Lots of good stuff, meaningful advice, and Herman Melville.
#academia #academicjobmarket #researchstatement #hermanmelville

Show thread

Pierluca D'Oro Nov 16, 2022

@antirez Cu joca sulu vinci.

Show thread

Pierluca D'Oro Nov 12, 2022

@fabian @tmlrpub @tmlrcert Thank you so much for creating these, it helps a lot in creating the right vibe!

Pierluca D'Oro Nov 12, 2022

Fabian Pedregosa Nov 10, 2022

I've now created @tmlrpub for published papers and @tmlrcert for certifications at TMLR.

This place starts feeling like home 🏡

Show thread

Pierluca D'Oro Nov 11, 2022

@psc @jhamrick Thanks for the comments, Pablo! There is indeed a close relationship between value-aware models and bisimulation, and this is an interesting perspective on it!

Show thread

Pierluca D'Oro Nov 11, 2022

@psc I'd say huge ones are various forms of ad placement and online bids, through MABs and related algorithms.

The interesting bit is that these might be among the most lucrative applications right now by far, despite not using any deep networks!

Pierluca D'Oro Nov 9, 2022

Asking for #rl opinions (#2).

What is deep reinforcement for you? Is it just RL with neural networks?

If so, should we call previous work from the 80s/90s deep RL? If not, what are the peculiar features of deep RL?

Show thread

Pierluca D'Oro Nov 9, 2022

@saiborg Yes! But shouldn't a value function have a sense of the evolution of a system before doing a prediction about the return?

Show thread

Pierluca D'Oro Nov 9, 2022

@jhamrick Yes! I was implicitly referring to value-equivalent/value-aware models.

Since they are are not constrained to be similar to the actual transition model, I sometimes wonder if it is more natural to think of them simply as inducing particular inductive biases (maybe more precisely, learning architectures) for value-based RL, and not really as part of model-based methods.

Website	https://proceduralia.github.io/
Google Scholar	https://scholar.google.com/citations?user=AuVp7pkAAAAJ&hl=it
Twitter	https://twitter.com/proceduralia