All
Search
Images
Videos
Shorts
Maps
News
More
Shopping
Flights
Travel
Notebook
Report an inappropriate content
Please select one of the options below.
Not Relevant
Offensive
Adult
Child Sexual Abuse
Length
All
Short (less than 5 minutes)
Medium (5-20 minutes)
Long (more than 20 minutes)
Date
All
Past 24 hours
Past week
Past month
Past year
Resolution
All
Lower than 360p
360p or higher
480p or higher
720p or higher
1080p or higher
Source
All
Dailymotion
Vimeo
Metacafe
Hulu
VEVO
Myspace
MTV
CBS
Fox
CNN
MSN
Price
All
Free
Paid
Clear filters
SafeSearch:
Moderate
Strict
Moderate (default)
Off
Filter
3:27
New short course on Reinforcement Learning from Human Feedback!
…
7.3K views
Dec 13, 2023
Facebook
Andrew Ng
What Is Reinforcement Learning From Human Feedback (RLHF)? | I
…
Nov 10, 2023
ibm.com
6:06:21
【6小时教程】完整 LLM 实战课程:从 Transformer 到 RLHF 全流程
3.4K views
5 months ago
bilibili
AIDeepCoder
20:28
RLHF: Training Language Models to Follow Instructions with Human F
…
2.2K views
Mar 22, 2024
YouTube
DataMListic
35:28
LLM后训练SFT、RLHF原理全面解析
428 views
5 months ago
bilibili
AI技术新视界
3:14:37
RLHF from scratch, step-by-step, in code
2.8K views
8 months ago
YouTube
Ashwani Kumar
19:39
Reinforcement Learning, RLHF, & DPO Explained
16.2K views
Jun 12, 2024
YouTube
Mark Hennings
1:01:53
LLM: Pretraining, Instruction fine-tuning and RLHF
6.3K views
Jul 31, 2023
YouTube
YanAITalk
6:18
What is LLM RLHF ?
424 views
5 months ago
YouTube
New Machina
11:29
Reinforcement Learning from Human Feedback (RLHF) Explained
78.8K views
Aug 7, 2024
YouTube
IBM Technology
1:23:59
OpenRLHF:大规模分布式RLHF训练系统介绍
3.8K views
Sep 1, 2024
bilibili
NICE学术
25:03
Reinforcement Learning with Human Feedback (RLHF) | Reinforcement
…
1.9K views
9 months ago
YouTube
Unfold Data Science
1:19:41
直观理解大模型预训练和微调!四大LLM微调方法,RLHF基于人类反馈
…
2.4K views
Oct 22, 2024
bilibili
转行AI大模型
53:07
Reinforced Self-Training (ReST) for Language Modeling (Paper Explai
…
34.5K views
Sep 3, 2023
YouTube
Yannic Kilcher
4:00
RLHF Explained: How We Train AI to Match Human Values
145 views
2 months ago
YouTube
CodeLucky
1:31
吹爆!全网最快30分钟实现从零复现RLHF训练法!!代码实战篇【附源
…
1.2K views
Nov 11, 2024
bilibili
大模型入门学习中心
0:57
How RLHF Creates Human-Like AI
2.8K views
Feb 7, 2025
YouTube
SCALER
LLM の LoRA / RLHF によるファインチューニング用のツールキットま
…
May 13, 2023
note(ノート)
npaka
1:47
Unlock the Power of Generative AI with RLHF Powered by Appen
17.2K views
Mar 31, 2023
YouTube
Appen
10:17
Reinforcement Learning through Human Feedback - EXPLAINED! |
…
29K views
Dec 11, 2023
YouTube
CodeEmporium
Open-sourcing RLHF with LoRA for LLaMA-3.1 in PyTorch | Arjun Gup
…
9K views
2 months ago
linkedin.com
2:15:13
Reinforcement Learning from Human Feedback explained with
…
67.1K views
Feb 27, 2024
YouTube
Umar Jamil
9:08
Reinforcement Learning from Human Feedback Explained (and
…
4.9K views
Dec 13, 2023
YouTube
What's AI by Louis-François Bouchard
6:31
Reinforcement Learning: ChatGPT and RLHF
23.7K views
Aug 14, 2023
YouTube
Graphics in 5 Minutes
1:00:38
Reinforcement Learning from Human Feedback: From Zero to c
…
187.5K views
Dec 13, 2022
YouTube
HuggingFace
1:53
RLHF训练法从零复现,代码实战,大语言模型训练
21.3K views
May 8, 2024
bilibili
蓝斯诺特
18:55
RLHF - Llama 3.1 8B | Alpaca Dataset | LoRA | PyTorch | On con
…
111 views
2 months ago
YouTube
ARJUNTHEPROGRAMMER
53:40
Lec 07 | Reinforcement Learning from Human Feedback: Part 01
741 views
5 months ago
YouTube
LCS2
7:51
Generative Reward Models: Merging the Power of RLHF and RLAIF for
…
2.1K views
Oct 27, 2024
YouTube
AI Papers Academy
38:24
Proximal Policy Optimization (PPO) - How to train Large Language Mod
…
80.3K views
Jan 24, 2024
YouTube
Serrano.Academy
See more videos
More like this
Feedback