Michael Dennis

84 Followers
115 Following
23 Posts
PhD student at the Center for Human-Compatible AI at UC Berkeley working on Game Theory and Reinforcement Learning

If anyone here on #SigmoidSocial is interested in understanding the effects our #AI or #RL systems have on society, I would encourage you to check out the communities Tom is organizing (two of which are linked below)

The design decisions and research directions of today, are fixtures and industries of tomorrow. It's always great to find spaces where these considerations are discussed openly and seriously

[https://sigmoid.social/@tkgilbert/109547206519360129 | CHI workshop]
[https://sigmoid.social/@tkgilbert/109538077146112536 | PERLS reading group]

tkgilbert (@[email protected])

Do you have policy ideas for societal-scale AI? AND want to design for new capabilities? Consider submitting to our #CHI2023 workshop: "Designing Platform Technology and Policy Simultaneously". More info here: http://designpolicy.one/ We are soliciting short position papers (1-2 pages) and short research papers (4-6 pages). Deadline is February 23, 2023--plenty of time to put something together!

Sigmoid Social

Do you have policy ideas for societal-scale AI? AND want to design for new capabilities? Consider submitting to our #CHI2023 workshop: "Designing Platform Technology and Policy Simultaneously".

More info here: http://designpolicy.one/

We are soliciting short position papers (1-2 pages) and short research papers (4-6 pages). Deadline is February 23, 2023--plenty of time to put something together!

Design Policy · A CHI'23 workshop

#introduction

I am a scientist at Meta AI in NYC and study machine learning and optimization, recently involving reinforcement learning, control, optimal transport, and geometry. On social media, I enjoy finding and boosting interesting content from the original authors on these topics

I made this small animation with my recent project on optimal transport that connects continuous structures in the world. The source code to reproduce this and other examples is online at https://github.com/facebookresearch/w2ot

GitHub - facebookresearch/w2ot: Euclidean Wasserstein-2 optimal transportation

Euclidean Wasserstein-2 optimal transportation. Contribute to facebookresearch/w2ot development by creating an account on GitHub.

GitHub

Reminds me of:
DeepRL that matters
DeepRL at the Edge of the Statistical Precipice
IPPO and MAPPO
Measuring and Characterizing Generalization in Deep RL
The teammate transfer results in PSRO
(Probably many others, LMK if I’ve forgotten something!)

Seems like a reoccurring theme that should be taken more seriously if we care about real sustained progress in MARL

[https://twitter.com/michaeld1729/status/1604855967879172096?s=46&t=wxGlb03ZPSMgGiSpSUli_g | XP]
Very interesting result, showing that one can do very well on the very popular SMAC benchmark without looking at the observations!

This should be a reminder that progress in RL and MARL is hard to measure well, and even when progress is being made by a well defined metric, it is often not for the reasons we think!

#MARL #RL #reinforcementlearning

Michael Dennis on Twitter

“Great experiment! Goes to show that evaluating RL, especially MARL, can be very difficult to do well.”

Twitter
Now that Musk destroyed Twitter, I am excited to (re)announce the (re)launch of PERLS for 2023! For those interested in participating, I am planning to purchase physical book copies of The Oxford Handbook of Ethics of AI and What We Owe The Future. If you wish to take part, @ me. I will have the book(s) sent straight to you. It'll be fun!

Did that notify you @dhadfieldmenell, or do I have to tag you specifically for that?

Wondering if quote-retoot is a first-order functionality here.

Anyone on #SigmoidSocial experimenting with open source recommendation systems for mastodon? With all the #ai folks here, this would be the place to experiment!

https://mastodon.mit.edu/@dhadfieldmenell/109537318540205247

Dylan Hadfield-Menell (@[email protected])

@[email protected] @[email protected] I don’t think there is anything for this. It seems like this is the right time to be building up options. An open source recommendation system could be very interesting

mastodon.mit.edu

@natolambert @michael_dennis

I don’t think there is anything for this. It seems like this is the right time to be building up options. An open source recommendation system could be very interesting