Reinforcement Learning (UCL Course on RL)

Lecture Review
작성자
관리자
작성일
2020-03-12 10:43
조회
4407
Study 기간: 2016. 09. ~ 2016. 12.

참여 인원: 지도교수 강필성, 박사과정 김준홍, 김창엽, 통합과정 김형석, 박민식, 석사과정 김보섭, 김해동, 류나현, 조수현, 서덕성, 박재선, 이기창, 모경현

대상 강의: http://www0.cs.ucl.ac.uk/staff/d.silver/web/Teaching.html
일시 Chapter 발표자 발표자료
2016-09-26 1. Introduction to Reinforcement Learning (week1) 조수현 발표자료
2016-10-10 2. Markov Decision Processes (week2) 이기창 발표자료
2016-10-17 3. Planning by Dynamic Programming (week3) 김창엽 발표자료
2016-11-07 4. Model-Free Prediction (week4) 서덕성 발표자료
2016-11-21 5. Model-Free Control (week 5) 류나현 발표자료
2016-11-28 6. Value Function Approximation (week 6) 박재선 발표자료
2016-12-4 7. Policy Gradient Methods (week 7) 모경현 발표자료
2016-12-11 8. Integrating Learning and Planning (week 8) 박민식 발표자료
2016-12-28 9. Exploration and Exploitation (week 9) 김형석 발표자료
2016-12-28 10. Case Study: RL in Classic Games (week 10) 김보섭 발표자료
 2017-01-03 11. Deep Q-Network and AlphaGo 박재선 발표자료
전체 0

전체 554
번호 제목 작성자 작성일 추천 조회
공지사항
Paper Reviews 2019 Q3
관리자 | 2020.03.12 | 추천 0 | 조회 14839
관리자 2020.03.12 0 14839
공지사항
Paper Reviews 2019 Q2
관리자 | 2020.03.12 | 추천 0 | 조회 13589
관리자 2020.03.12 0 13589
공지사항
Paper Reviews 2019 Q1
관리자 | 2020.03.12 | 추천 0 | 조회 14536
관리자 2020.03.12 0 14536
551
[Paper Review] Programming Refusal with Conditional Activation Steering (10)
Sunmin Kim | 2026.03.10 | 추천 0 | 조회 138
Sunmin Kim 2026.03.10 0 138
550
[Paper Review] Towards a General Time Series Anomaly Detector with Adaptive Bottlenecks and Dual Adversarial Decoders (8)
Sunghun Lim | 2026.03.01 | 추천 0 | 조회 198
Sunghun Lim 2026.03.01 0 198
549
[Paper Review] Rethinking the Power of Timestamps for Robust Time Series Forecasting: A Global-Local Fusion Perspective (8)
Suyeon Shin | 2026.02.25 | 추천 0 | 조회 146
Suyeon Shin 2026.02.25 0 146
548
[Paper Review] Recent Research Trends Foundation Model for Visual Anomaly Detection (10)
Jaehyuk Heo | 2026.02.12 | 추천 0 | 조회 341
Jaehyuk Heo 2026.02.12 0 341
547
[Paper Review] Vision-based and Multimodal Approaches for Time Series Analysis (8)
Hyeongwon Kang | 2026.02.10 | 추천 0 | 조회 310
Hyeongwon Kang 2026.02.10 0 310
546
[Paper Review] Introduction to Neural Operator (10)
Hankyeol Kim | 2026.02.03 | 추천 0 | 조회 380
Hankyeol Kim 2026.02.03 0 380
545
[Paper Review] Enhancing Time Series Forecasting through Selective Representation Spaces: A Patch Perspective (12)
Sieon Park | 2026.01.29 | 추천 0 | 조회 440
Sieon Park 2026.01.29 0 440
544
[Paper Review] ELFS: Label-Free Coreset Selection with Proxy Training Dynamics (12)
Subeen Cha | 2026.01.28 | 추천 0 | 조회 312
Subeen Cha 2026.01.28 0 312
543
[Paper Review] Model Merging for Continual Learning (11)
Hun Im | 2026.01.24 | 추천 0 | 조회 308
Hun Im 2026.01.24 0 308
542
[Paper Review] Selective Learning for Deep Time Series Forecasting (13)
Jinwoo Park | 2026.01.24 | 추천 0 | 조회 438
Jinwoo Park 2026.01.24 0 438

Data Science & Business Analytics Lab.
Department of Industrial Engineering, College of Engineering,
Seoul National University

Contact Us

  • 강필성 교수 (pilsung_kang@snu.ac.kr)
    서울특별시 관악구 관악로 1 서울대학교 공과대학 39동 301호 
  • 대학원 연구실 (총무 김재희: jaehee_kim@snu.ac.kr)
    서울특별시 관악구 관악로 1 서울대학교 공과대학 39동 411호