Analyzing baseball data with R
著者
書誌事項
Analyzing baseball data with R
(The R series)
CRC Press, c2014
大学図書館所蔵 全10件
  青森
  岩手
  宮城
  秋田
  山形
  福島
  茨城
  栃木
  群馬
  埼玉
  千葉
  東京
  神奈川
  新潟
  富山
  石川
  福井
  山梨
  長野
  岐阜
  静岡
  愛知
  三重
  滋賀
  京都
  大阪
  兵庫
  奈良
  和歌山
  鳥取
  島根
  岡山
  広島
  山口
  徳島
  香川
  愛媛
  高知
  福岡
  佐賀
  長崎
  熊本
  大分
  宮崎
  鹿児島
  沖縄
  韓国
  中国
  タイ
  イギリス
  ドイツ
  スイス
  フランス
  ベルギー
  オランダ
  スウェーデン
  ノルウェー
  アメリカ
注記
Includes bibliographical references (p. 325-328) and index
内容説明・目次
内容説明
With its flexible capabilities and open-source platform, R has become a major tool for analyzing detailed, high-quality baseball data. Analyzing Baseball Data with R provides an introduction to R for sabermetricians, baseball enthusiasts, and students interested in exploring the rich sources of baseball data. It equips readers with the necessary skills and software tools to perform all of the analysis steps, from gathering the datasets and entering them in a convenient format to visualizing the data via graphs to performing a statistical analysis.
The authors first present an overview of publicly available baseball datasets and a gentle introduction to the type of data structures and exploratory and data management capabilities of R. They also cover the traditional graphics functions in the base package and introduce more sophisticated graphical displays available through the lattice and ggplot2 packages. Much of the book illustrates the use of R through popular sabermetrics topics, including the Pythagorean formula, runs expectancy, career trajectories, simulation of games and seasons, patterns of streaky behavior of players, and fielding measures. Each chapter contains exercises that encourage readers to perform their own analyses using R. All of the datasets and R code used in the text are available online.
This book helps readers answer questions about baseball teams, players, and strategy using large, publically available datasets. It offers detailed instructions on downloading the datasets and putting them into formats that simplify data exploration and analysis. Through the book's various examples, readers will learn about modern sabermetrics and be able to conduct their own baseball analyses.
目次
The Baseball Datasets
Introduction
The Lahman Database: Season-by-Season Data
Retrosheet Game-by-Game Data
Retrosheet Play-by-Play Data
Pitch-by-Pitch Data
Introduction to R
Introduction
Installing R and RStudio
Vectors
Objects and Containers in R
Collection of R Commands
Reading and Writing Data in R
Data Frames
Packages
Splitting, Applying, and Combining Data
Traditional Graphics
Introduction
Factor Variable
Saving Graphs
Dot Plots
Numeric Variable: Stripchart and Histogram
Two Numeric Variables
A Numeric Variable and a Factor Variable
Comparing Ruth, Aaron, Bonds, and A-Rod
The 1998 Home Run Race
The Relation between Runs and Wins
Introduction
The Teams Table in Lahman's Database
Linear Regression
The Pythagorean Formula for Winning Percentage
The Exponent in the Pythagorean Formula
Good and Bad Predictions by the Pythagorean Formula
How Many Runs for a Win?
Value of Plays Using Run Expectancy
The Runs Expectancy Matrix
Runs Scored in the Remainder of the Inning
Creating the Matrix
Measuring Success of a Batting Play
Albert Pujols
Opportunity and Success for All Hitters
Position in the Batting Lineup
Run Values of Different Base Hits
Value of Base Stealing
Advanced Graphics
Introduction
The lattice Package
The ggplot2 Package
Balls and Strikes Effects
Introduction
Hitter's Counts and Pitcher's Counts
Behaviors by Count
Career Trajectories
Introduction
Mickey Mantle's Batting Trajectory
Comparing Trajectories
General Patterns of Peak Ages
Trajectories and Fielding Position
Simulation
Introduction
Simulating a Half Inning
Simulating a Baseball Season
Exploring Streaky Performances
Introduction
The Great Streak
Streaks in Individual At-Bats
Local Patterns of Weighted On-Base Average
Learning about Park Effects by Database Management Tools
Introduction
Installing MySQL and Creating a Database
Connecting R to MySQL
Filling a MySQL Game Log Database from R
Querying Data from R
Baseball Data as MySQL Dumps
Calculating Basic Park Factors
Exploring Fielding Metrics with Contributed R Packages
Introduction
A Motivating Example: Comparing Fielding Metrics
Comparing Two Shortstops
Appendix A: Retrosheet Files Reference
Appendix B: Accessing and Using MLBAM Gameday and PITCHf/x Data
Bibliography
Index
Further Reading and Exercises appear at the end of each chapter.
「Nielsen BookData」 より