This research validates a weekly re-trained DRL agent, showing it outperforms static models & Black-Scholes for practical American option hedging. https://hackernoon.com/validating-hyperparameters-and-a-weekly-re-training-strategy-for-drl-option-hedging #deepreinforcementlearning
Validating Hyperparameters and a Weekly Re-training Strategy for DRL Option Hedging | HackerNoon

This research validates a weekly re-trained DRL agent, showing it outperforms static models & Black-Scholes for practical American option hedging.

This methodology details how to train and test DRL agents for American option hedging, introducing a novel weekly re-training strategy using Chebyshev pricing. https://hackernoon.com/dont-just-train-your-ai-re-train-it-the-weekly-workout-plan-for-a-smarter-option-hedge #deepreinforcementlearning
Don't Just Train Your AI, Re-Train It: The Weekly Workout Plan for a Smarter Option Hedge | HackerNoon

This methodology details how to train and test DRL agents for American option hedging, introducing a novel weekly re-training strategy using Chebyshev pricing.

This review of DRL hedging literature highlights the need for hyperparameter analysis, especially for real-world American option applications. https://hackernoon.com/avoiding-the-pitfalls-a-guide-to-the-current-state-of-drl-option-hedging-research #deepreinforcementlearning
Avoiding the Pitfalls: A Guide to the Current State of DRL Option Hedging Research | HackerNoon

This review of DRL hedging literature highlights the need for hyperparameter analysis, especially for real-world American option applications.

This paper makes Deep Reinforcement Learning practical for hedging American options by optimizing hyperparameters and using a weekly re-training strategy. https://hackernoon.com/how-weekly-ai-training-is-beating-a-nobel-prize-winning-formula #deepreinforcementlearning
How Weekly AI Training Is Beating a Nobel Prize-Winning Formula | HackerNoon

This paper makes Deep Reinforcement Learning practical for hedging American options by optimizing hyperparameters and using a weekly re-training strategy.

RoboPianist: Dexterous Piano Playing with Deep Reinforcement Learning (2023) — https://kzakka.com/robopianist/#demo
#HackerNews #RoboPianist #DeepReinforcementLearning #PianoAI #MachineLearning #Robotics #2023
RoboPianist

Autonomy Talks - Georgia Chalvatzaki: Shaping #Robotic Assistance through Structured #Robot #Learning: https://www.youtube.com/watch?v=e0aQC3C8P7w #robotics #machinelearning

Around 12:30 they present the training of a model-free #MDP #deepreinforcementlearning using a model-based #ai #planner #aiplanner. Indeed it drastically boosts the training.

The general idea is to guide an implicit model using a model-based approximation, and it works also for assembly tasks, computer vision, pick and place…

Autonomy Talks - Georgia Chalvatzaki: Shaping Robotic Assistance through Structured Robot Learning

YouTube
Last but not least, came Tekgul & Asokan's "FLARE: Fingerprinting Deep Reinforcement Learning Agents using Universal Adversarial Masks" which is robust to model modification attacks. (https://www.acsac.org/2023/program/final/s264.html) 4/4
#MachineLearningSecurity #DeepReinforcementLearning #SecurityInAI
ACSAC2023 Program – powered by OpenConf

Mini-Quadkopter lernt Fliegen in Sekunden | heise online
https://heise.de/-9623443 #DeepReinforcementLearning #ReinforcementLearning #RL
Quadkopter lernt in Sekunden Fliegen durch Curriculum-Lernen

Wissenschaftler bringen einem Quadkopter das Fliegen bei. Das Training dauert durch eine optimierte Curriculum-Methode nur ein paar Sekunden.

heise online
KI-Roboter: Meister des Labyrinth-Geschicklichkeitsspiels

Die Welt der Robotik und der künstlichen Intelligenz (KI) hat mit einem KI-Roboter einen neuen Meilenstein erreicht und lässt aufhorchen.

Tarnkappe.info
Will the next generation of #LLM come from #DeepMind?
https://www.wired.com/story/google-deepmind-demis-hassabis-chatgpt/
They may have a shot at it given their expertise in #DeepReinforcementLearning. If their #AI can plan tasks with solid logical grounds, can't they also produce solid explanations?
Google DeepMind CEO Demis Hassabis Says Its Next Algorithm Will Eclipse ChatGPT

The company is working on a system called Gemini that will draw on techniques that powered AlphaGo to a historic victory over a Go champion in 2016.

WIRED