Skip to search.

Breaking News Visit Yahoo! News for the latest.

×Close this window

baseball-databank · Baseball Databank

The Yahoo! Groups Product Blog

Check it out!

Group Information

? Already a member? Sign in to Yahoo!

Yahoo! Groups Tips

Did you know...
Message search is now enhanced, find messages faster. Take it for a spin.

Messages

Advanced
Messages Help
Messages 3925 - 3954 of 4385   Oldest  |  < Older  |  Newer >  |  Newest
Messages: Show Message Summaries Sort by Date ^  
#3925 From: Wells Oliver <wells@...>
Date: Mon Mar 1, 2010 7:57 pm
Subject: Question about Gameday data: pitch types
xmutex
Send Email Send Email
 
Hey everyone. I'm working through some little things and I'm wondering if I am just missing something obvious: what I'd like to do is use the Gameday data to determine a given lineup's skill against pitch types. So I'm looking at the probable starter for a given game, and getting his pitch types like so:

SELECT
pitcher player_id,
count(*) c,
pitch_type
FROM gameday.pitch
WHERE pitcher IN (%s)
AND pitch_type IS NOT NULL
GROUP BY pitch_type
ORDER BY c DESC

This gives you a breakdown of # of pitch type thrown (as per Gameday determinations). However, it seems like Gameday is lacking the info to really figure out how well a batter does against a certain kind of pitch. You can count from the same pitch table where the batter is a certain ID and group by the 'des' column to see on what pitch types the hitter made a hit, but it seems... incomplete.

Any tips here?

--
Wells Oliver
wells@...

#3926 From: Wells Oliver <wells@...>
Date: Thu Mar 4, 2010 8:14 pm
Subject: Re: Gameday data gone?
xmutex
Send Email Send Email
 
ARGH, nevermind- was using 'content' and not 'component'. Apologies to one and all.

On Thu, Mar 4, 2010 at 1:15 PM, Wells Oliver <wells@...> wrote:
Anyone know if MLB.com is still going to provide the standard XML data for Gameday this year? I know it's just spring training, and so no pitchfx data, but even the standard XML files containing play by play, etc, don't appear there:

http://gd2.mlb.com/content/game/mlb/year_2010/month_03/

--
Wells Oliver
wells@...



--
Wells Oliver
wells@...

#3927 From: Sean Forman <sean-forman@...>
Date: Thu Mar 4, 2010 8:28 pm
Subject: Re: Re: Gameday data gone?
sforman71
Send Email Send Email
 


sean
---
Sean Forman
Sports Reference LLC, President
http://www.sports-reference.com/


On Thu, Mar 4, 2010 at 3:14 PM, Wells Oliver <wells@...> wrote:
 

ARGH, nevermind- was using 'content' and not 'component'. Apologies to one and all.

On Thu, Mar 4, 2010 at 1:15 PM, Wells Oliver <wells@...> wrote:
Anyone know if MLB.com is still going to provide the standard XML data for Gameday this year? I know it's just spring training, and so no pitchfx data, but even the standard XML files containing play by play, etc, don't appear there:

http://gd2.mlb.com/content/game/mlb/year_2010/month_03/

--
Wells Oliver
wells@...



--
Wells Oliver
wells@...



#3928 From: Wells Oliver <wells@...>
Date: Thu Mar 4, 2010 7:15 pm
Subject: Gameday data gone?
xmutex
Send Email Send Email
 
Anyone know if MLB.com is still going to provide the standard XML data for Gameday this year? I know it's just spring training, and so no pitchfx data, but even the standard XML files containing play by play, etc, don't appear there:

http://gd2.mlb.com/content/game/mlb/year_2010/month_03/

--
Wells Oliver
wells@...

#3929 From: Paul Golba <pgolba2@...>
Date: Thu Mar 11, 2010 3:20 am
Subject: Errata in the 1998 NL
pgolba2
Send Email Send Email
 
First email as part of the group.  Apologies if I am violating any protocols.

I've been trying to use the Baseball Databank (BDB) database to compute Win
Shares.  I've been following the book's step-by-step example with the 1998 St.
Louis Cardinals and I keep running into small discrepancies between the book and
the BDB.  Most of the time the book matches Baseball-Reference (B-R) and/or
Retrosheet.

Issues that I have encountered so far (all involved the 1998 NL):
1. Roberto Petagine (CIN) has 2 GIDP in BDB.  B-R has 1.  Retrosheet has him
hitting into two double plays, but one was a line drive type.
2. There are three pitcher HBP in the NL missing.  According to B-R John Thomson
(COL) has 2 and Gabe Gonzalez (FLO) has 1.  BDB has 0 for both.
3. The league fielding putouts (69710) to not match the total of the league
IPOuts (69719).
4. The league IPOuts (69719) do not match B-R (69720).  I haven't tracked down
where that stray out is yet.
5. For the entire NL, BDB has 837 WP, B-R has 835.  Again, I didn't track down
particular players yet.
6. The pitching runs allowed (not earned runs, just plain old runs) is off
significantly.  BDB has 11918, B-R 11943.  I did track down a couple of players
with Florida that have discrepancies.  Antonio Alfoseca has has 32 in BDB
(matching his ER) but 36 in B-R.  Felix Heredia has 25 in BDB (also matching his
ER) but 30 in B-R.  There are more discrepancies than those two though.

There are also issues with fielding assists, but that does not concern me as
much as the other stuff.

Is this the right place to report these issues?  I am willing to help track down
discrepancies, upon request.

Paul Golba

#3930 From: "Tangotiger" <tom@...>
Date: Thu Mar 11, 2010 2:18 pm
Subject: Re: Errata in the 1998 NL
tom@...
Send Email Send Email
 
> Is this the right place to report these issues?  I am willing to help
> track down discrepancies, upon request.
>
> Paul Golba
>

This is absolutely the right place.  And while you didn't break any
protocols (if we even have any), from where I sit you can break whatever
protocols you need to, to report data issues like this.

Truth > whatever

Tom

#3931 From: Wells Oliver <wells@...>
Date: Thu Mar 18, 2010 4:05 pm
Subject: Useful list of players?
xmutex
Send Email Send Email
 
So a quick question here- I'm working on my little projection system and I'm doing a lot of hacky crap to put together the "list of MLB players" to project. Right now I'm using the entire 40 man rosters of every team as provided by mlb.com. I have to use Tango's list of IDs - which is awesome, but still under development - to match each MLB player up with their BDB ID, and excluding any guy w/o a BDB ID brings the rosters down by 10ish.

I need the BDB ID as I'm using the BDB's stats for projections.

Is there some cleaner way of doing this? Some better "list of players" from which to work?

Any tips appreciated. Thanks!

--
Wells Oliver
wells@...

#3932 From: "Tangotiger" <tom@...>
Date: Thu Mar 18, 2010 6:14 pm
Subject: Re: Useful list of players?
tom@...
Send Email Send Email
 
If you are ONLY going to use players with a BDB_ID, why don't you do:

select player_id from batting where year_id between 2007 and 2009;

Tom


> So a quick question here- I'm working on my little projection system and
> I'm
> doing a lot of hacky crap to put together the "list of MLB players" to
> project. Right now I'm using the entire 40 man rosters of every team as
> provided by mlb.com. I have to use Tango's list of IDs - which is awesome,
> but still under development - to match each MLB player up with their BDB
> ID,
> and excluding any guy w/o a BDB ID brings the rosters down by 10ish.
>
> I need the BDB ID as I'm using the BDB's stats for projections.
>
> Is there some cleaner way of doing this? Some better "list of players"
> from
> which to work?
>
> Any tips appreciated. Thanks!
>
> --
> Wells Oliver
> wells@...
>


---------------------------------------------
The Book--Playing The Percentages In Baseball
http://www.InsideTheBook.com

#3933 From: "Tangotiger" <tom@...>
Date: Thu Mar 18, 2010 6:17 pm
Subject: Re: Useful list of players?
tom@...
Send Email Send Email
 
select distinct player_id ...

Tom

#3934 From: "kgold11" <kgoldpc@...>
Date: Sat Mar 20, 2010 2:29 am
Subject: link team to retroID
kgold11
Send Email Send Email
 
If I have a file with retroID and player name, are there files out there that I
can use to link those names with a team?  I would prefer to use Excel if
possible.

#3935 From: SABRMnLgs@...
Date: Sat Mar 20, 2010 12:39 am
Subject: Re: link team to retroID
sabrmnlgs
Send Email Send Email
 
you need to contact Sean-Foreman@...
 
In a message dated 3/19/2010 10:42:20 P.M. Eastern Daylight Time, kgoldpc@... writes:
 

If I have a file with retroID and player name, are there files out there that I can use to link those names with a team? I would prefer to use Excel if possible.


#3936 From: "Tangotiger" <tom@...>
Date: Sun Mar 21, 2010 3:08 pm
Subject: Re: link team to retroID
tom@...
Send Email Send Email
 
Do I understand you correctly that you want to see:

playerID,teamID

So that you will see the team for every player the team the player played on?

In that case:
select a.playerID as bdbid, retroid,teamID,min(yearid) as
minYear,max(yearid) as maxyear

from appearances a inner join master m
on a.playerid = m.playerid

group by a.playerID,retroid,teamID
;

Tom

> you need to contact _Sean-Foreman@..._
> (mailto:Sean-Foreman@...)
>
>
> In a message dated 3/19/2010 10:42:20 P.M. Eastern Daylight Time,
> kgoldpc@... writes:
>
>
>
>
> If I have a file with retroID and player name, are there files out there
> that I can use to link those names with a team? I would prefer to use
> Excel
> if  possible.
>
>
>
>


---------------------------------------------
The Book--Playing The Percentages In Baseball
http://www.InsideTheBook.com

#3937 From: "kgold11" <kgoldpc@...>
Date: Mon Mar 22, 2010 1:53 am
Subject: Re: link team to retroID
kgold11
Send Email Send Email
 
Thanks Tango. This worked perfectly.

--- In baseball-databank@yahoogroups.com, "Tangotiger" <tom@...> wrote:
>
> Do I understand you correctly that you want to see:
>
> playerID,teamID
>
> So that you will see the team for every player the team the player played on?
>
> In that case:
> select a.playerID as bdbid, retroid,teamID,min(yearid) as
> minYear,max(yearid) as maxyear
>
> from appearances a inner join master m
> on a.playerid = m.playerid
>
> group by a.playerID,retroid,teamID
> ;
>
> Tom
>
> > you need to contact _Sean-Foreman@..._
> > (mailto:Sean-Foreman@...)
> >
> >
> > In a message dated 3/19/2010 10:42:20 P.M. Eastern Daylight Time,
> > kgoldpc@... writes:
> >
> >
> >
> >
> > If I have a file with retroID and player name, are there files out there
> > that I can use to link those names with a team? I would prefer to use
> > Excel
> > if  possible.
> >
> >
> >
> >
>
>
> ---------------------------------------------
> The Book--Playing The Percentages In Baseball
> http://www.InsideTheBook.com
>

#3938 From: Wells Oliver <wells@...>
Date: Thu Mar 25, 2010 6:35 pm
Subject: Verifying process / math on HR/9 rates between leagues
xmutex
Send Email Send Email
 
Trying to figure out league adjustments for 2009- doing HR per 9 right now. Querying like so:

SELECT
SUM(HR),
SUM(HR) / SUM(IPouts / 3) * 9,
lgID
FROM Pitching
WHERE yearID = 2009
GROUP BY lgID;

Yields 1.1127 for the AL and 0.9928 for the NL.

Therefore the difference in rate would be a 0.1 advantage to the AL.

Seems kinda odd. Am I going about this the right way? Thanks for any tips.

--
Wells Oliver
wells@...

#3939 From: "Tangotiger" <tom@...>
Date: Fri Mar 26, 2010 9:01 pm
Subject: Re: Verifying process / math on HR/9 rates between leagues
tom@...
Send Email Send Email
 
Your method of multiplying the AL HR rate by 89% to get it to the same
mean as the NL rate implies that the average AL pitcher = average NL
pitcher.

This is not necessarily true, and almost certainly not true.

Tom

> Trying to figure out league adjustments for 2009- doing HR per 9 right
> now.
> Querying like so:
>
> SELECT
> SUM(HR),
> SUM(HR) / SUM(IPouts / 3) * 9,
> lgID
> FROM Pitching
> WHERE yearID = 2009
> GROUP BY lgID;
>
> Yields 1.1127 for the AL and 0.9928 for the NL.
>
> Therefore the difference in rate would be a 0.1 advantage to the AL.
>
> Seems kinda odd. Am I going about this the right way? Thanks for any tips.
>
> --
> Wells Oliver
> wells@...
>


---------------------------------------------
The Book--Playing The Percentages In Baseball
http://www.InsideTheBook.com

#3940 From: Wells Oliver <wells@...>
Date: Fri Mar 26, 2010 9:05 pm
Subject: Gameday strike zone question
xmutex
Send Email Send Email
 
Which of the lovely attributes in the <pitch> elements contains where the pitch was in the strike zone? (the numbers you use to make a heat map)

http://gd2.mlb.com/components/game/mlb/year_2009/month_07/day_05/gid_2009_07_05_balmlb_anamlb_1/inning/inning_4.xml

--
Wells Oliver
wells@...

#3941 From: Clay Dreslough <cjd@...>
Date: Sun Mar 28, 2010 2:03 pm
Subject: Re: Gameday strike zone question
dreslough
Send Email Send Email
 
I don't know the answer to this, but looking at the data I would guess this is
what they are doing:

These two coordinates show where the ball crossed the plate:

px="0.397"
pz="2.207"

px is the ball's "inside/outside" position. 0.00 is the middle of the plate.
negative is (probably) inside on a
right-handed hitter. positive is outside

pz is the ball's height off the ground. 0.00 is the ground itself.

Each pitch also lists the top and bottom of the strike zone, which varies by
batter:

sz_top="3.72"
sz_bot="1.72"

Wells Oliver wrote:
>
>
> Which of the lovely attributes in the <pitch> elements contains where
> the pitch was in the strike zone? (the numbers you use to make a heat map)
>
>
http://gd2.mlb.com/components/game/mlb/year_2009/month_07/day_05/gid_2009_07_05_\
balmlb_anamlb_1/inning/inning_4.xml
>
<http://gd2.mlb.com/components/game/mlb/year_2009/month_07/day_05/gid_2009_07_05\
_balmlb_anamlb_1/inning/inning_4.xml>
>
> --
> Wells Oliver
> wells@... <mailto:wells@...>

--
Clay Dreslough
President, Sports Mogul Inc.
cjd@...
www.sportsmogul.com
Baseball Mogul 2011, now available:
http://www.sportsmogul.com/games/baseball2k11.html

#3942 From: Wells Oliver <wells@...>
Date: Mon Mar 29, 2010 4:53 pm
Subject: Question re: splits on baseball-reference.com
xmutex
Send Email Send Email
 
Any idea what would account for the difference in career split totals on yahoo.com and baseball-reference.com?

Ichiro on Yahoo: http://sports.yahoo.com/mlb/players/6615/splits;_ylt=AlNqRTkpU1Aas_ifYQm778GFCLcF?year=career&type=Batting

Ichiro on b-ref: http://www.baseball-reference.com/players/split.cgi?n1=suzukic01&year=Career&t=b

Just the nature of using different data sources? Post-season play?

--
Wells Oliver
wells@...

#3943 From: Sean Forman <sean-forman@...>
Date: Mon Mar 29, 2010 6:01 pm
Subject: Re: Question re: splits on baseball-reference.com
sforman71
Send Email Send Email
 
Wells,

You need to be a lot more specific.  What are the differences you are seeing?

sean
---
Sean Forman
Sports Reference LLC, President
http://www.sports-reference.com/


On Mon, Mar 29, 2010 at 12:53 PM, Wells Oliver <wells@...> wrote:
 

Any idea what would account for the difference in career split totals on yahoo.com and baseball-reference.com?

Ichiro on Yahoo: http://sports.yahoo.com/mlb/players/6615/splits;_ylt=AlNqRTkpU1Aas_ifYQm778GFCLcF?year=career&type=Batting

Ichiro on b-ref: http://www.baseball-reference.com/players/split.cgi?n1=suzukic01&year=Career&t=b

Just the nature of using different data sources? Post-season play?

--
Wells Oliver
wells@...



#3944 From: "Tangotiger" <tom@...>
Date: Mon Mar 29, 2010 6:11 pm
Subject: Re: Question re: splits on baseball-reference.com
tom@...
Send Email Send Email
 
Also, look at Retrosheet as the "tie-breaker".

Tom

> Any idea what would account for the difference in career split totals on
> yahoo.com and baseball-reference.com?
>

#3945 From: Anthony Giacalone <giacalone.anthony@...>
Date: Mon Mar 29, 2010 6:38 pm
Subject: Re: Question re: splits on baseball-reference.com
anthonygiaca...
Send Email Send Email
 
Sean, 

I noticed that bb-ref normalized career splits are messed for pitchers at least.  They are off by a column and they don't seem to equal the right number of plate appearances.

Anthony Giacalone

Sent from my iPhone

On Mar 29, 2010, at 1:01 PM, Sean Forman <sean-forman@...> wrote:

 

Wells,


You need to be a lot more specific.  What are the differences you are seeing?

sean
---
Sean Forman
Sports Reference LLC, President
http://www.sports-reference.com/


On Mon, Mar 29, 2010 at 12:53 PM, Wells Oliver <wells@submute.net> wrote:
 

Any idea what would account for the difference in career split totals on yahoo.com and baseball-reference.com?

Ichiro on Yahoo: http://sports.yahoo.com/mlb/players/6615/splits;_ylt=AlNqRTkpU1Aas_ifYQm778GFCLcF?year=career&type=Batting

Ichiro on b-ref: http://www.baseball-reference.com/players/split.cgi?n1=suzukic01&year=Career&t=b

Just the nature of using different data sources? Post-season play?

--
Wells Oliver
wells@submute.net



#3946 From: Wells Oliver <wells@...>
Date: Mon Mar 29, 2010 7:06 pm
Subject: Re: Question re: splits on baseball-reference.com
xmutex
Send Email Send Email
 
Sorry, was kind of a pre-caffeine question. The importat numbers (AB, H, etc) are the same between b-ref and Yahoo for Ichiro's splits are the same. The only number that differs is Games (G) and I guess I don't really care too much about that :)

- WElls

On Mon, Mar 29, 2010 at 1:11 PM, Tangotiger <tom@...> wrote:
 

Also, look at Retrosheet as the "tie-breaker".

Tom



> Any idea what would account for the difference in career split totals on
> yahoo.com and baseball-reference.com?
>




--
Wells Oliver
wells@...

#3947 From: Sean Forman <sean-forman@...>
Date: Mon Mar 29, 2010 7:12 pm
Subject: Re: Question re: splits on baseball-reference.com
sforman71
Send Email Send Email
 
Anthony,

I think that has been fixed.  Let me know if that is not the case.

sean
---
Sean Forman
Sports Reference LLC, President
http://www.sports-reference.com/


On Mon, Mar 29, 2010 at 2:38 PM, Anthony Giacalone <giacalone.anthony@...> wrote:
 

Sean, 

I noticed that bb-ref normalized career splits are messed for pitchers at least.  They are off by a column and they don't seem to equal the right number of plate appearances.

Anthony Giacalone

Sent from my iPhone

On Mar 29, 2010, at 1:01 PM, Sean Forman <sean-forman@...> wrote:

 

Wells,


You need to be a lot more specific.  What are the differences you are seeing?

sean
---
Sean Forman
Sports Reference LLC, President
http://www.sports-reference.com/


On Mon, Mar 29, 2010 at 12:53 PM, Wells Oliver <wells@...> wrote:
 

Any idea what would account for the difference in career split totals on yahoo.com and baseball-reference.com?

Ichiro on Yahoo: http://sports.yahoo.com/mlb/players/6615/splits;_ylt=AlNqRTkpU1Aas_ifYQm778GFCLcF?year=career&type=Batting

Ichiro on b-ref: http://www.baseball-reference.com/players/split.cgi?n1=suzukic01&year=Career&t=b

Just the nature of using different data sources? Post-season play?

--
Wells Oliver
wells@...




#3948 From: Sean Forman <sean-forman@...>
Date: Mon Mar 29, 2010 7:16 pm
Subject: Re: Question re: splits on baseball-reference.com
sforman71
Send Email Send Email
 


On Mon, Mar 29, 2010 at 3:06 PM, Wells Oliver <wells@...> wrote:
 

Sorry, was kind of a pre-caffeine question. The importat numbers (AB, H, etc) are the same between b-ref and Yahoo for Ichiro's splits are the same. The only number that differs is Games (G) and I guess I don't really care too much about that :)

- WElls




My guess is that for some of them they may be counting games where any pitcher was a RH or LH rather than just games he faced one.  They may also be counting baserunning in some of the others.  I tried to be careful that my splits are just show a game played when the batter batted in that situation.  Just a guess.  Not 100% sure.

sean

#3949 From: "wydiyd" <wydiyd@...>
Date: Tue Mar 30, 2010 3:01 pm
Subject: Primary Position Dataset
wydiyd
Send Email Send Email
 
For 2008, I was able to get a player's primary position sheet.  Does anyone know
if someone has done one for the 2009 season?  thanks.

#3950 From: Wells Oliver <wells@...>
Date: Tue Mar 30, 2010 3:32 pm
Subject: Re: Primary Position Dataset
xmutex
Send Email Send Email
 
Colin Wyers has a SQL query here. You can use it to create a view or table:

http://basql.wikidot.com/primary-position

(I think that site belongs to Colin Wyers)

On Tue, Mar 30, 2010 at 10:01 AM, wydiyd <wydiyd@...> wrote:
 

For 2008, I was able to get a player's primary position sheet. Does anyone know if someone has done one for the 2009 season? thanks.




--
Wells Oliver
wells@...

#3951 From: "Tangotiger" <tom@...>
Date: Tue Mar 30, 2010 3:33 pm
Subject: Re: Primary Position Dataset
tom@...
Send Email Send Email
 
I posted an Access DB that calculates the prim pos for a player for any
season (or career.. can't remember).  Look in the files section somewhere
for a file called PrimPos.mdb or such.

Tom

> For 2008, I was able to get a player's primary position sheet.  Does
> anyone know if someone has done one for the 2009 season?  thanks.
>
>


---------------------------------------------
The Book--Playing The Percentages In Baseball
http://www.InsideTheBook.com

#3952 From: "David" <terpsfan101@...>
Date: Wed Mar 31, 2010 10:14 pm
Subject: Re: Primary Position Dataset
terpsfan101
Send Email Send Email
 
My modified version of the Lahman/BDB databse has primary positions for all
players listed in the batting table:

http://www.mediafire.com/?omc4yizmugz

The method I use to determine primary position is the same as Tango's, except I
used defensive innings (actual and estimated prior to 1952) instead of games.

#3953 From: Paul Golba <pgolba2@...>
Date: Sat Apr 3, 2010 8:37 pm
Subject: RetroSheet ID Missing from Master Table
pgolba2
Send Email Send Email
 
There are 205 records in the Data-Bank Master table that have a PlayerID but no
retroID.  They are also missing the holtzID, the lahman40ID, and the lahman45ID,
but I do not know if any of those are relevant (I suspect the lahman IDs are for
obsolete versions of that DB, no idea about holtzID).  The bbrefID is populated
for all of them.

Of the 205 records, 204 are for players that debuted in 2009.  In fact ALL of
the players that debuted in 2009 are missing that information.  The other
playerID is hemonro99 for a Roland Hemond, who does not appear in the Batting,
Pitching, or Fielding tables and and appears to be a baseball executive with no
playing experience.

Is this going to be fixed?  Or did I just volunteer?

Paul Golba

#3954 From: Wells Oliver <wells@...>
Date: Sat Apr 3, 2010 12:00 am
Subject: List of players (with their MLBAM ID) by year from MLB.com?
xmutex
Send Email Send Email
 
Just wondering if anyone's come across a way of retrieving/viewing all of the MLB players for a given year w/ their MLBAM ID from MLB.com.

Thanks.

--
Wells Oliver
wells@...

Messages 3925 - 3954 of 4385   Oldest  |  < Older  |  Newer >  |  Newest
Add to My Yahoo!      XML What's This?

Copyright © 2010 Yahoo! Inc. All rights reserved.
Privacy Policy - Terms of Service - Guidelines NEW - Help