Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Edit Models filters

Apps
llama.cpp
LM Studio
Jan
Draw Things
DiffusionBee
JoyFusion
vLLM
Ollama
MLX LM
Docker Model Runner
Lemonade
SGLang
Unsloth
Pi
Inference Providers
Groq
Novita
Cerebras
SambaNova
Nscale
fal
Hyperbolic
Together AI
Fireworks
Featherless AI
Zai
Replicate
Cohere
Scaleway
Public AI
OVHcloud AI Endpoints
HF Inference API
WaveSpeed
Misc
deep-rl-course
Inference Endpoints
text-generation-inference
Eval Results (legacy)
text-embeddings-inference
4-bit precision
custom_code
Merge
8-bit precision
Mixture of Experts
Carbon Emissions
Eval Results

Models

2,077
Full-text search
Active filters: deep-rl-course

gusainanurag58/ppo-LunarLander-v2

Reinforcement Learning • Updated 8 days ago • 38

Kaushik23/LunarLander-v2

Reinforcement Learning • Updated 8 days ago

HamzaChera/ppo-CartPole-v1

Reinforcement Learning • Updated 4 days ago

HamzaChera/cleanRL-ppo-LunarLander-v2

Reinforcement Learning • Updated 4 days ago

blackcodetavern/ppo-CartPole-Other

Reinforcement Learning • Updated about 22 hours ago

zhaojizhang/PPO-LunarLander-v2

Reinforcement Learning • Updated about 20 hours ago

brk0zt/LunarLander-v2

Reinforcement Learning • Updated about 11 hours ago
  • Previous
  • 1
  • ...
  • 68
  • 69
  • 70
  • Next
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs