Artificial Intelligence: Reinforcement Learning in Python
Год выпуска: 12/2024
Производитель: Udemy
Сайт производителя:
https://www.udemy.com/course/artificial-intelligence-reinforcement-learning-in-python/
Автор: Lazy Programmer Inc.
Продолжительность: 14:42:47
Тип раздаваемого материала: Видеоурок
Язык: Английский
Субтитры: English, German, Spanish, French, Italian, Portuguese
Описание:
What you'll learn
- Apply gradient-based supervised machine learning methods to reinforcement learning
- Understand reinforcement learning on a technical level
- Understand the relationship between reinforcement learning and psychology
- Implement 17 different reinforcement learning algorithms
- Understand important foundations for OpenAI ChatGPT, GPT-4
Requirements
- Calculus (derivatives)
- Probability / Markov Models
- Numpy, Matplotlib
- Beneficial to have experience with at least a few supervised machine learning methods
- Gradient descent
- Good object-oriented programming skills
Description
- Ever wondered how AI technologies like OpenAI ChatGPT and GPT-4 really work? In this course, you will learn the foundations of these groundbreaking applications.
- When people talk about artificial intelligence, they usually don’t mean supervised and unsupervised machine learning.
- These tasks are pretty trivial compared to what we think of AIs doing - playing chess and Go, driving cars, and beating video games at a superhuman level.
- Reinforcement learning has recently become popular for doing all of that and more.
- Much like deep learning, a lot of the theory was discovered in the 70s and 80s but it hasn’t been until recently that we’ve been able to observe first hand the amazing results that are possible.
- In 2016 we saw Google’s AlphaGo beat the world Champion in Go.
- We saw AIs playing video games like Doom and Super Mario.
- Self-driving cars have started driving on real roads with other drivers and even carrying passengers (Uber), all without human assistance.
- If that sounds amazing, brace yourself for the future because the law of accelerating returns dictates that this progress is only going to continue to increase exponentially.
- Learning about supervised and unsupervised machine learning is no small feat. To date I have over TWENTY FIVE (25!) courses just on those topics alone.
- And yet reinforcement learning opens up a whole new world. As you’ll learn in this course, the reinforcement learning paradigm is very from both supervised and unsupervised learning.
- It’s led to new and amazing insights both in behavioral psychology and neuroscience. As you’ll learn in this course, there are many analogous processes when it comes to teaching an agent and teaching an animal or even a human. It’s the closest thing we have so far to a true artificial general intelligence.
What’s covered in this course?
- The multi-armed bandit problem and the explore-exploit dilemma
- Ways to calculate means and moving averages and their relationship to stochastic gradient descent
- Markov Decision Processes (MDPs)
- Dynamic Programming
- Monte Carlo
- Temporal Difference (TD) Learning (Q-Learning and SARSA)
- Approximation Methods (i.e. how to plug in a deep neural network or other differentiable model into your RL algorithm)
- How to use OpenAI Gym, with zero code changes
- Project: Apply Q-Learning to build a stock trading bot
If you’re ready to take on a brand new challenge, and learn about AI techniques that you’ve never seen before in traditional supervised machine learning, unsupervised machine learning, or even deep learning, then this course is for you.
See you in class!
"If you can't implement it, you don't understand it"
- Or as the great physicist Richard Feynman said: "What I cannot create, I do not understand".
- My courses are the ONLY courses where you will learn how to implement machine learning algorithms from scratch
- Other courses will teach you how to plug in your data into a library, but do you really need help with 3 lines of code?
- After doing the same thing with 10 datasets, you realize you didn't learn 10 things. You learned 1 thing, and just repeated the same 3 lines of code 10 times...
Suggested Prerequisites:
- Calculus
- Probability
- Object-oriented programming
- Python coding: if/else, loops, lists, dicts, sets
- Numpy coding: matrix and vector operations
- Linear regression
- Gradient descent
WHAT ORDER SHOULD I TAKE YOUR COURSES IN?:
- Check out the lecture "Machine Learning and AI Prerequisite Roadmap" (available in the FAQ of any of my courses, including the free Numpy course)
UNIQUE FEATURES
- Every line of code explained in detail - email me any time if you disagree
- No wasted time "typing" on the keyboard like other courses - let's be honest, nobody can really write code worth learning about in just 20 minutes from scratch
- Not afraid of university-level math - get important details about algorithms that other courses leave out
Who this course is for:
- Anyone who wants to learn about artificial intelligence, data science, machine learning, and deep learning
- Both students and professionals
Формат видео: MP4
Видео: avc, 1280x720, 16:9, 30000 к/с, 780 кб/с
Аудио: aac, 44.1 кгц, 128 кб/с, 2 аудио
Изменения/Changes
Version 2019/5 compared to 2019/1 the number of 1 lesson and 3 minutes, the time it is added. The size is about 300 MB has increased.
Version 2019/12 compared to 2019/5 about 300 MB increase in size is. Probably about 10 lessons increased.
Version 2020/8 compared to 2019/12 at least 8 lessons and 2.5 hours increased.
Version 2021/5 compared to 2020/8 has increased by 1 section (3 lessons and about 2 hours).
Version 2023/11 compared to 2021/5 has increased the number of 2 lessons and the duration of 3 minutes. Also, the Quality of the course has increased from 720p to 1080p.
The 2024/12 version has not changed in the number of lessons and duration compared to 2023/11, but the course quality has been reduced from 1080p to 720p, and subtitles have also been added.
MediaInfo
General
Complete name : D:\1\Udemy - Artificial Intelligence Reinforcement Learning in Python (12.2024)\6 - Monte Carlo\3 -Monte Carlo Policy Evaluation in Code.mp4
Format : MPEG-4
Format profile : Base Media
Codec ID : isom (isom/iso2/avc1/mp41)
File size : 51.6 MiB
Duration : 7 min 52 s
Overall bit rate : 916 kb/s
Frame rate : 30.000 FPS
Writing application : Lavf58.12.100
Video
ID : 1
Format : AVC
Format/Info : Advanced Video Codec
Format profile : Main@L3.1
Format settings : CABAC / 4 Ref Frames
Format settings, CABAC : Yes
Format settings, Reference frames : 4 frames
Codec ID : avc1
Codec ID/Info : Advanced Video Coding
Duration : 7 min 52 s
Bit rate : 780 kb/s
Nominal bit rate : 3 000 kb/s
Width : 1 280 pixels
Height : 720 pixels
Display aspect ratio : 16:9
Frame rate mode : Constant
Frame rate : 30.000 FPS
Color space : YUV
Chroma subsampling : 4:2:0
Bit depth : 8 bits
Scan type : Progressive
Bits/(Pixel*Frame) : 0.028
Stream size : 43.9 MiB (85%)
Writing library : x264 core 148
Encoding settings : cabac=1 / ref=3 / deblock=1:0:0 / analyse=0x1:0x111 / me=umh / subme=6 / psy=1 / psy_rd=1.00:0.00 / mixed_ref=1 / me_range=16 / chroma_me=1 / trellis=1 / 8x8dct=0 / cqm=0 / deadzone=21,11 / fast_pskip=1 / chroma_qp_offset=-2 / threads=22 / lookahead_threads=3 / sliced_threads=0 / nr=0 / decimate=1 / interlaced=0 / bluray_compat=0 / constrained_intra=0 / bframes=3 / b_pyramid=2 / b_adapt=1 / b_bias=0 / direct=1 / weightb=1 / open_gop=0 / weightp=2 / keyint=60 / keyint_min=6 / scenecut=0 / intra_refresh=0 / rc_lookahead=60 / rc=cbr / mbtree=1 / bitrate=3000 / ratetol=1.0 / qcomp=0.60 / qpmin=0 / qpmax=69 / qpstep=4 / vbv_maxrate=3000 / vbv_bufsize=6000 / nal_hrd=none / filler=0 / ip_ratio=1.40 / aq=1:1.00
Codec configuration box : avcC
Audio
ID : 2
Format : AAC LC
Format/Info : Advanced Audio Codec Low Complexity
Codec ID : mp4a-40-2
Duration : 7 min 52 s
Bit rate mode : Constant
Bit rate : 128 kb/s
Channel(s) : 2 channels
Channel layout : L R
Sampling rate : 44.1 kHz
Frame rate : 43.066 FPS (1024 SPF)
Compression mode : Lossy
Stream size : 7.22 MiB (14%)
Default : Yes
Alternate group : 1