Contact energy based hindsight experience prioritization

buir.contributor.authorÖğüz, Salih Özgür
buir.contributor.orcidÖğüz, Salih Özgür|0000-0001-8723-1837
dc.citation.epage5440
dc.citation.spage5434
dc.contributor.authorSayar, Erdi
dc.contributor.authorBing, Zhenshan
dc.contributor.authorD'Eramo, Carlo
dc.contributor.authorÖğüz, Salih Özgür
dc.contributor.authorKnoll, Alois
dc.coverage.spatialYokohama, JAPAN
dc.date.accessioned2025-02-22T08:50:32Z
dc.date.available2025-02-22T08:50:32Z
dc.date.issued2024-08-08
dc.departmentDepartment of Computer Engineering
dc.descriptionConference Name: IEEE International Conference on Robotics and Automation (ICRA)
dc.descriptionDate of Conference:13-17 May 2024
dc.description.abstractMulti-goal robot manipulation tasks with sparse rewards are difficult for reinforcement learning (RL) algorithms due to the inefficiency in collecting successful experiences. Recent algorithms such as Hindsight Experience Replay (HER) expedite learning by taking advantage of failed trajectories and replacing the desired goal with one of the achieved states so that any failed trajectory can be utilized as a contribution to learning. However, HER uniformly chooses failed trajectories, without taking into account which ones might be the most valuable for learning. In this paper, we address this problem and propose a novel approach Contact Energy Based Prioritization (CEBP) to select the samples from the replay buffer based on rich information due to contact, leveraging the touch sensors in the gripper of the robot and object displacement. Our prioritization scheme favors sampling of contact-rich experiences, which are arguably the ones providing the largest amount of information. We evaluate our proposed approach on various sparse reward robotic tasks and compare it with the state-of-the-art methods. We show that our method surpasses or performs on par with those methods on robot manipulation tasks. Finally, we deploy the trained policy from our method to a real Franka robot for a pick-and-place task. We observe that the robot can solve the task successfully. The videos and code are publicly available at: https://erdiphd.github.io/HER force/.
dc.description.provenanceSubmitted by Aleyna Demirkıran (aleynademirkiran@bilkent.edu.tr) on 2025-02-22T08:50:32Z No. of bitstreams: 1 Contact_Energy_Based_Hindsight_Experience_Prioritization (1).pdf: 1888322 bytes, checksum: 3794d2e58d21881d7828eda8a4c9fd12 (MD5)en
dc.description.provenanceMade available in DSpace on 2025-02-22T08:50:32Z (GMT). No. of bitstreams: 1 Contact_Energy_Based_Hindsight_Experience_Prioritization (1).pdf: 1888322 bytes, checksum: 3794d2e58d21881d7828eda8a4c9fd12 (MD5) Previous issue date: 2024-08-08en
dc.identifier.doi10.1109/ICRA57147.2024.10610910
dc.identifier.isbn979-8-3503-8457-4
dc.identifier.urihttps://hdl.handle.net/11693/116619
dc.language.isoEnglish
dc.publisherIEEE
dc.relation.ispartofseriesBook Series
dc.relation.isversionofhttps://dx.doi.org/10.1109/ICRA57147.2024.10610910
dc.source.title2024 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2024
dc.subjectTraining
dc.subjectCodes
dc.subjectCatalysts
dc.subjectTactile sensors
dc.subjectReinforcement learning
dc.subjectTrajectory
dc.subjectFriction
dc.titleContact energy based hindsight experience prioritization
dc.typeConference Paper

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Contact_Energy_Based_Hindsight_Experience_Prioritization (1).pdf
Size:
1.8 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: