상세정보 | 동명대학교 중앙도서관

메뉴 바로가기
본문 바로가기(skip to content)
검색 바로가기
도서관정보 바로가기

메세지가 없습니다

전체메뉴 닫기

상세정보

검색상세정보

Python Reinforcement Learning : Solve Complex Real-World Problems by Mastering Reinforcement Learning Algorithms Using OpenAI Gym and TensorFlow

Ravichandiran, Sudharsan

책이미지
서평쓰기: 서평쓰기

책이미지

QRcode

QRcode 사용방법

QRcode 사용방법

QR코드 정의: 사각형의 가로세로 격자무늬에 다양한 정보를 담고 있는 2차원(매트릭스)형식의 코드로 스마트폰으로 QR코드를 스캔하면 각종 정보를 제공 받을 수 있습니다.
QR코드 인식프로그램: 스마트폰 마켓에 들어가면 'QR코드 스캔 프로그램'을 검색해 무료 어플을 다운받아 사용하시면 됩니다.
도서정보 QR코드 이용하기: 도서 상세정보에서 QR코드를 스캔하면, 모바일 웹페이지로 연결되어 해당 도서의 상세정보 및 소장정보 등을 확인 할 수 있습니다.

상세정보
자료유형	E-BOOK
서명/저자사항	Python Reinforcement Learning : Solve Complex Real-World Problems by Mastering Reinforcement Learning Algorithms Using OpenAI Gym and TensorFlow/ Sudharsan Ravichandiran, Sean Saito, Rajalingappaa Shanmugamani and Yang Wenzhuo.
개인저자	Ravichandiran, Sudharsan,author, Saito, Sean,author, Shanmugamani, Rajalingappaa,author, Wenzhuo, Yang,author,
발행사항	Birmingham: Packt Publishing, Limited, 2019.
형태사항	1 online resource (484 pages).
기타형태 저록	Print version: Ravichandiran, Sudharsan. Python Reinforcement Learning : Solve Complex Real-World Problems by Mastering Reinforcement Learning Algorithms Using OpenAI Gym and TensorFlow. Birmingham : Packt Publishing, Limited, ©2019 9781838649777
ISBN	1838640142 9781838640149
일반주기	Implementation of the Atari emulator
내용주기	Cover; Title Page; Copyright and Credits; About Packt; Contributors; Table of Contents; Preface; Chapter 1: Introduction to Reinforcement Learning; What is RL?; RL algorithm; How RL differs from other ML paradigms; Elements of RL; Agent; Policy function; Value function; Model; Agent environment interface; Types of RL environment; Deterministic environment; Stochastic environment; Fully observable environment; Partially observable environment; Discrete environment; Continuous environment; Episodic and non-episodic environment; Single and multi-agent environment; RL platforms OpenAI Gym and UniverseDeepMind Lab; RL-Glue; Project Malmo; ViZDoom; Applications of RL; Education; Medicine and healthcare; Manufacturing; Inventory management; Finance; Natural Language Processing and Computer Vision; Summary; Questions; Further reading; Chapter 2: Getting Started with OpenAI and TensorFlow; Setting up your machine; Installing Anaconda; Installing Docker; Installing OpenAI Gym and Universe; Common error fixes; OpenAI Gym; Basic simulations; Training a robot to walk; OpenAI Universe; Building a video game bot; TensorFlow; Variables, constants, and placeholders; Variables ConstantsPlaceholders; Computation graph; Sessions; TensorBoard; Adding scope; Summary; Questions; Further reading; Chapter 3: The Markov Decision Process and Dynamic Programming; The Markov chain and Markov process; Markov Decision Process; Rewards and returns; Episodic and continuous tasks; Discount factor; The policy function; State value function; State-action value function (Q function); The Bellman equation and optimality; Deriving the Bellman equation for value and Q functions; Solving the Bellman equation; Dynamic programming; Value iteration; Policy iteration Solving the frozen lake problemValue iteration; Policy iteration; Summary; Questions; Further reading; Chapter 4: Gaming with Monte Carlo Methods; Monte Carlo methods; Estimating the value of pi using Monte Carlo; Monte Carlo prediction; First visit Monte Carlo; Every visit Monte Carlo; Let's play Blackjack with Monte Carlo; Monte Carlo control; Monte Carlo exploration starts; On-policy Monte Carlo control; Off-policy Monte Carlo control; Summary; Questions; Further reading; Chapter 5: Temporal Difference Learning; TD learning; TD prediction; TD control; Q learning Solving the taxi problem using Q learningSARSA; Solving the taxi problem using SARSA; The difference between Q learning and SARSA; Summary; Questions; Further reading; Chapter 6: Multi-Armed Bandit Problem; The MAB problem; The epsilon-greedy policy; The softmax exploration algorithm; The upper confidence bound algorithm; The Thompson sampling algorithm; Applications of MAB; Identifying the right advertisement banner using MAB; Contextual bandits; Summary; Questions; Further reading; Chapter 7: Playing Atari Games; Introduction to Atari games; Building an Atari emulator; Getting started
요약	Reinforcement learning and deep reinforcement learning are the trending and most promising branches of artificial intelligence. This Learning Path will enable you to master not only the basic reinforcement learning algorithms but also the advanced deep reinforcement learning algorithms and their limitations.
일반주제명	Python (Computer program language) Reinforcement learning.
언어	영어
바로가기	URL

소장정보

서비스 이용안내

서가에없는책
도서배달서비스
인쇄

메세지가 없습니다
No.	등록번호	청구기호	소장처	도서상태	반납예정일	예약	서비스	매체정보
1	EB89796	URL 참조006.31	중앙도서관	대출가능
2	EB89797	URL 참조006.31	중앙도서관	대출가능
3	EB89801	URL 참조006.31	중앙도서관	대출가능
4	EB89799	URL 참조006.31	중앙도서관	대출가능
5	EB89800	URL 참조006.31	중앙도서관	대출가능
6	EB89798	URL 참조006.31	중앙도서관	대출가능

청구기호 Browsing

이 분야 인기자료

가짜뉴스의 사회학 = the sociology of f...

박창호

서울경제경영, 2024

비밀의 도서관 : 호메로스에서 케인스까지 99권으로 읽...

티얼, 올리버

생각정거장, 2017

우리가 사랑한 부전도서관 : 부전도서관 휴관 전말, 여...

이양숙

해피북미디어, 2024

그곳에 책이 있었다

파이퍼, 앤드루

책읽는수요일: 한국학술정보, 2014

부석사

배병선

대원사, 2000

서평 (0 건)

*주제와 무관한 내용의 서평은 삭제될 수 있습니다.

서평추가

별점

*서평 입력 시 선택한 별점은 전체 별점에 반영됩니다.

제목입력

본문입력

태그

Tag List
Tag Cloud

Quick Menu

My Menu