Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Juntao Dai's picture
1 4 8

Juntao Dai

calico-1226
panjinhao0320's profile picture Gaie's profile picture mickelliu's profile picture
·
  • calico-1226

AI & ML interests

RLHF

Organizations

OmniSafeAI's profile picture PKU-Alignment's profile picture Physis AI's profile picture

upvoted a collection 3 months ago

AgentDoG

Collection
A Diagnostic Guardrail Framework for AI Agent Safety and Security • 11 items • Updated about 24 hours ago • 107
upvoted a collection over 1 year ago

SafeSora

Collection
Towards Safety Alignment of Text2Video Generation • 4 items • Updated Aug 15, 2024 • 2
upvoted a paper over 2 years ago

Safe RLHF: Safe Reinforcement Learning from Human Feedback

Paper • 2310.12773 • Published Oct 19, 2023 • 28
upvoted a paper almost 3 years ago

BeaverTails: Towards Improved Safety Alignment of LLM via a Human-Preference Dataset

Paper • 2307.04657 • Published Jul 10, 2023 • 6
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs