- Python Reinforcement Learning
- Sudharsan Ravichandiran Sean Saito Rajalingappaa Shanmugamani Yang Wenzhuo
- 34字
- 2021-06-24 15:17:23
Deterministic environment
An environment is said to be deterministic when we know the outcome based on the current state. For instance, in a chess game, we know the exact outcome of moving any player.
推薦閱讀
- 我們都是數(shù)據(jù)控:用大數(shù)據(jù)改變商業(yè)、生活和思維方式
- 數(shù)據(jù)存儲(chǔ)架構(gòu)與技術(shù)
- Building Computer Vision Projects with OpenCV 4 and C++
- 數(shù)據(jù)產(chǎn)品經(jīng)理高效學(xué)習(xí)手冊(cè):產(chǎn)品設(shè)計(jì)、技術(shù)常識(shí)與機(jī)器學(xué)習(xí)
- 大數(shù)據(jù)技術(shù)基礎(chǔ)
- 算法競(jìng)賽入門經(jīng)典:習(xí)題與解答
- Test-Driven Development with Mockito
- 正則表達(dá)式必知必會(huì)
- 從0到1:數(shù)據(jù)分析師養(yǎng)成寶典
- MySQL基礎(chǔ)教程
- 軟件成本度量國(guó)家標(biāo)準(zhǔn)實(shí)施指南:理論、方法與實(shí)踐
- Python金融數(shù)據(jù)分析(原書第2版)
- 數(shù)據(jù)庫(kù)設(shè)計(jì)與應(yīng)用(SQL Server 2014)(第二版)
- 云數(shù)據(jù)中心網(wǎng)絡(luò)與SDN:技術(shù)架構(gòu)與實(shí)現(xiàn)
- 數(shù)據(jù)科學(xué)實(shí)戰(zhàn)指南