mlb_teams | R Documentation |
Major League Baseball Teams Data.
Description
A subset of data on Major League Baseball teams from Lahman's Baseball Database. The full dataset is available in the Lahman R package.
Usage
mlb_teams
Format
A data frame with 2784 rows and 41 variables.
- year
Year of play.
- league_id
League the team plays in with levels AL (American League) and NL (National League).
- division_id
Division the team plays in with levels W (west), E (east) and C (central).
- rank
Team's rank in their division at the end of the regular season.
- games_played
Games played.
- home_games
Games played at home.
- wins
Number of games won.
- losses
Number of games lost.
- division_winner
Did the team win their division? Levels of Y (yes) and N (no).
- wild_card_winner
Was the team a wild card winner. Levels of Y (yes) and N (no).
- league_winner
Did the team win their league? Levels of Y (yes) and N (no).
- world_series_winner
Did the team win the World Series? Levels of Y (yes) and N (no).
- runs_scored
Number of runs scored during the season.
- at_bats
Number of at bats during the season.
- hits
Number of hits during the season. Includes singles, doubles, triples and homeruns.
- doubles
Number of doubles hit.
- triples
Number of triples hit.
- homeruns
Homeruns by batters.
- walks
Number of walks.
- strikeouts_by_batters
Number of batters struckout.
- stolen_bases
Number of stolen bases.
- caught_stealing
Number of base runners caught stealing.
- batters_hit_by_pitch
Number of batters hit by a pitch.
- sacrifice_flies
Number of sacrifice flies.
- opponents_runs_scored
Number of runs scored by opponents.
- earned_runs_allowed
Number of earned runs allowed.
- earned_run_average
Earned run average.
- complete_games
Number of games where a single pitcher played the entire game.
- shutouts
Number of shutouts.
- saves
Number of saves.
- outs_pitches
Number of outs pitched for the season (number of innings pitched times 3).
- hits_allowed
Number of hits made by opponents.
- homeruns_allowed
Number of homeruns hit by opponents.
- walks_allowed
Number of opponents who were walked.
- strikeouts_by_pitchers
Number of opponents who were struckout.
- errors
Number of errors.
- double_plays
Number of double plays.
- fielding_percentage
Teams fielding percentage.
- team_name
Full name of team.
- ball_park
Home ballpark name.
- home_attendance
Home attendance total.
Source
Lahmans Baseball Database
Examples
library(dplyr)
# List the World Series winning teams for each year
mlb_teams |>
filter(world_series_winner == "Y") |>
select(year, team_name, ball_park)
# List the teams with their average number of wins and losses
mlb_teams |>
group_by(team_name) |>
summarize(mean_wins = mean(wins), mean_losses = mean(losses)) |>
arrange((team_name))