DrEureka takes a robotic task description and uses an LLM to generate software implementations for a reward function that measures success in that task.
View Article on VentureBeat
AI,Automation,Programming & Development,AI, ML and Deep Learning,category-/Science/Engineering & Technology/Robotics,domain randomization (DR),DrEureka,Eureka Park,fine tuning,language models,large language models,LLMs,Nvidia,reinforcement learning,robotics,robotics systems,sim-to-real,sim-to-real-transfer,simulation,University of Pennsylvania,University of Texas at Austin
reinforcement learning