Skip to search.

Breaking News Visit Yahoo! News for the latest.

×Close this window

baseball-databank · Baseball Databank

The Yahoo! Groups Product Blog

Check it out!

Group Information

? Already a member? Sign in to Yahoo!

Yahoo! Groups Tips

Did you know...
Real people. Real stories. See how Yahoo! Groups impacts members worldwide.

Messages

Advanced
Messages Help
Messages 4251 - 4280 of 4385   Oldest  |  < Older  |  Newer >  |  Newest
Messages: Show Message Summaries Sort by Date ^  
#4251 From: Paul Golba <pgolba2@...>
Date: Sat Mar 24, 2012 8:34 pm
Subject: Re: Re: Stints
pgolba2
Send Email Send Email
 
My vote is to keep the stints as is. 

From a database perspective, it is much, much easier to take a stint divided table and sum it up to get the overall numbers than it is to try to take a combined table and then split it back up using a separate stint table.  I suspect it would be harder for the administrator to maintain two separate tables.

From a baseball perspective, the stint field in valuable to determine how a player moved from team to team during a season.  This is pretty basic information.  Does everyone need this information?  No.  Is it useful for people who do need this information?  Yes.

Also, along with the team discrepancies on the stints that I noted to start this (unexpected) thread, there are also several hundred pitching stints in the years 2009-2011 that did not have a Batting record at all.  Almost all of them are in the AL and I suspect none of the pitchers involved ever batted.  This is not a huge deal, except that for all other seasons if a playerID had a Pitching record he always had a Batting record, even if he never batted.  You may already be aware of it at this point, but I mention it anyway.

Paul Golba


From: anson2995 <slahman@...>
To: baseball-databank@yahoogroups.com
Sent: Tuesday, March 20, 2012 9:39 AM
Subject: [baseball-databank] Re: Stints

 
"Tangotiger" <tom@...> wrote:
> I'm asking if the cost of that use is a justifiable cost.
> We had bad data in the first release, and we have bad data
> according to the recent post, and it's traced to the stint ID.

The problem of bad data don't have anything to do with the database design. It's 100% attributable to me, the person who processed most of the updates. It's the first time in several years that I made the offseason updates rather than Sean Forman, and the scripts I used to make and check the updates were outdated. It shouldn't be a problem in the future.

I think it's much more labor intensive to use and maintain a table that lists start and end dates in a separate transaction file, especially if we continue to maintain batting/pitching/fielding as separate files.

But I'm certainly open to further discussion, on this or other design issues.

Regards,
Sean Lahman




#4252 From: "Tangotiger" <tom@...>
Date: Sat Mar 24, 2012 11:28 pm
Subject: Re: Re: Stints
tom@...
Send Email Send Email
 
Paul,

Your post is clear why we do *not* want to keep the stints as-is.

The requirement about chrono-stints can already be addressed by a Stints
table that shows the stint order.  Several posters have already responded
positively to this.

The "batting" table's dual role has already caused problems in the past.
I think the official MLB position is that the "batting" table is the "all
appearances" table, so that any game gets recorded with a "batting"
record, even if he didn't bat.  (I'm not exactly sure about this, but I'm
going on memory here, but it's consistent with players not batting still
having a batting record.)

Anyway, from a database perspective, we don't need to have a stint denoted
in the batting *and* pitching *and* fielding tables, and ensure it
matches.  The key fields of tables are supposed to identify records in not
such a rigid way that you would have to alter the key field if you find
the data needs to be updated.  That's why playerID fields should never
change, even if a pitcher's name gets changed.  You don't want to have the
stint as a key field, if it means that it may change if we have new
information.  Imagine we introduce minor league data.  Now, you've got
MASSIVE changes in key fields for tons of players across multiple tables.
(Think of players like JJ Hardy.)

At least, with a Stints table, it will be localized to a single table,
whose entire purpose is to track that.  Indeed, you wouldn't even need to
have the stintID have to be a key field.

Had we started with a clean slate, the Stints would be treated
equivalently to Home/Away splits or Inning splits or Starter/Relief
splits.  They'd be part of a child table.

Tom



> My vote is to keep the stints as is. 
>
>
> From a database perspective, it is much, much easier to take a stint
> divided table and sum it up to get the overall numbers than it is to try
> to take a combined table and then split it back up using a separate stint
> table.  I suspect it would be harder for the administrator to maintain
> two separate tables.
>
> From a baseball perspective, the stint field in valuable to determine how
> a player moved from team to team during a season.  This is pretty basic
> information.  Does everyone need this information?  No.  Is it useful
> for people who do need this information?  Yes.
>
> Also, along with the team discrepancies on the stints that I noted to
> start this (unexpected) thread, there are also several hundred pitching
> stints in the years 2009-2011 that did not have a Batting record at all. 
> Almost all of them are in the AL and I suspect none of the pitchers
> involved ever batted.  This is not a huge deal, except that for all other
> seasons if a playerID had a Pitching record he always had a Batting
> record, even if he never batted.  You may already be aware of it at this
> point, but I mention it anyway.
>
> Paul Golba
>
>
>
> ________________________________
>  From: anson2995 <slahman@...>
> To: baseball-databank@yahoogroups.com
> Sent: Tuesday, March 20, 2012 9:39 AM
> Subject: [baseball-databank] Re: Stints
>
>
>  
> "Tangotiger" <tom@...> wrote:
>> I'm asking if the cost of that use is a justifiable cost.
>> We had bad data in the first release, and we have bad data
>> according to the recent post, and it's traced to the stint ID.
>
> The problem of bad data don't have anything to do with the database
> design. It's 100% attributable to me, the person who processed most of the
> updates. It's the first time in several years that I made the offseason
> updates rather than Sean Forman, and the scripts I used to make and check
> the updates were outdated. It shouldn't be a problem in the future.
>
> I think it's much more labor intensive to use and maintain a table that
> lists start and end dates in a separate transaction file, especially if we
> continue to maintain batting/pitching/fielding as separate files.
>
> But I'm certainly open to further discussion, on this or other design
> issues.
>
> Regards,
> Sean Lahman
>
>
>


---------------------------------------------
The Book--Playing The Percentages In Baseball
http://www.InsideTheBook.com

#4253 From: "chrislambrou" <chrislambrou@...>
Date: Mon Mar 26, 2012 3:37 pm
Subject: Re: Stints
chrislambrou
Send Email Send Email
 
Does anyone have a comment on the reply below?

I'm always running into problems with appearances and have to use the fielding
table.  Not the best table to JOIN with since almost every player has multiple
records per season.

Thanks,
-Chris

--- In baseball-databank@yahoogroups.com, "Tangotiger" <tom@...> wrote:
>
The "batting" table's dual role has already caused problems in the past.
I think the official MLB position is that the "batting" table is the "all
appearances" table, so that any game gets recorded with a "batting"
record, even if he didn't bat. (I'm not exactly sure about this, but I'm
going on memory here, but it's consistent with players not batting still
having a batting record.)

#4254 From: "Tangotiger" <tom@...>
Date: Mon Mar 26, 2012 6:47 pm
Subject: Re: Re: Stints
tom@...
Send Email Send Email
 
There's an APPEARANCES table in the Lahman DB.  I haven't verified it, but
that might help you.

Tom

> Does anyone have a comment on the reply below?
>
> I'm always running into problems with appearances and have to use the
> fielding table.  Not the best table to JOIN with since almost every player
> has multiple records per season.
>
> Thanks,
> -Chris
>
> --- In baseball-databank@yahoogroups.com, "Tangotiger" <tom@...> wrote:
>>
> The "batting" table's dual role has already caused problems in the past.
> I think the official MLB position is that the "batting" table is the "all
> appearances" table, so that any game gets recorded with a "batting"
> record, even if he didn't bat. (I'm not exactly sure about this, but I'm
> going on memory here, but it's consistent with players not batting still
> having a batting record.)
>
>


---------------------------------------------
The Book--Playing The Percentages In Baseball
http://www.InsideTheBook.com

#4255 From: Nicholas Miceli <micelin01@...>
Date: Fri Apr 6, 2012 7:20 pm
Subject: Thank you to all.
nsmiceli
Send Email Send Email
 
Dear Members,

Thank you to all who responded to my request for help for readers.

As I get into the planning process for research, I'm sure that list members will have a great deal of constructive things to say.

I hope everyone has a great weekend.

Regards,

Nick Miceli

#4256 From: Clay Dreslough <cjd@...>
Date: Fri Apr 6, 2012 10:33 pm
Subject: Stints
upa2112
Send Email Send Email
 
I don't fully understand what is being proposed, but I just wanted to
take a moment to speak out in favor of backwards compatibility.

For example, people bought "Puresim 4" ( a baseball simulation game by
Shaun Sullivan). It loads data in the "Lahman" format. Even though the
game was published in 2010, it can still load last year's database and
this year's database. If the stint column goes away, many users will be
unable to load next year's database.

If it's too difficult to ensure that the stint data is correct, I'd
rather see it 90% correct, but with documentation that (for example)
stint data after 2010 may not be 100% accurate.

Clay

#4257 From: "Clem Comly" <ccomly@...>
Date: Sat Apr 7, 2012 5:06 am
Subject: Re: Stints
ccomly2003
Send Email Send Email
 
I don't currently have access to a good relational database.  If I did, the solution for stints in Fielding.csv is easy.  For all 2011 fielding stints for pitchers, update the stints to match stint in Pitching.csv for same player, year and team.  For all 2011 fielding stints for non-pitchers, update the stints to match stint in Batting.csv for same player, year and team. 
 
This will work for future years except when a player has 2 separate stints with the same team in the same year.  The fielding stints for those rare players will have to be handled manually.
 
Clem Comly

#4258 From: "Tangotiger" <tom@...>
Date: Sun Apr 8, 2012 3:05 pm
Subject: Re: Stints
tom@...
Send Email Send Email
 
BDB or Lahman DB would be "backward compatible" with the proposed change,
or at least have that as a potential.

If we do this with a STINTS table that shows it the way we've been talking
about it, then you could join the Batting, Pitching, Fielding table to the
STINTS table, and get the stintID for virtually every record.  There's no
reason that it has to be "backward compatible" out of the box.  But, one
extra join on each table will make it backward compatible.

The only issue will be the Rob Ducey's of the world, who leave and come
back to the same team in the same year, while interrupted by a different
MLB team.  Again, that can also be handled with a StintSplits table,
similar to have a splits table for Batting v Pitch Hand if we wanted to
eventually go there.

Anyway, this is really in Lahman's hands, as the keeper of the DB.  He
says that the data quality issues was that he needed to change his script.
  If that's the case, that we won't experience data quality issue again as
a result of stints, then, fine, the issue is mostly moot.  Those of us who
have no need to the stintID field can do the appropriate matching and
summing after-the-fact.

I will say that once you incorporate minor league data, we're going to
revisit this all over again.  As it currently stands, we're going to have
massive key changes when that happens.  JJ Hardy getting sent down to the
minors and being called up will have stintID 1 and 3 in 2009, whereas
right now, he has only 1 record.  And it'll get worse for guys that get
sent down and called up multiple times in the same year.  But, we're not
there yet.

Tom

#4259 From: "KJOK" <kjokbaseball@...>
Date: Tue Apr 10, 2012 8:06 pm
Subject: Re: 2012 Marlins teamID
kjokbaseball
Send Email Send Email
 
Retrosheet changed their minds and will be using TeamID MIA - I think we should
do the same...

THANKS,
Kevin

--- In baseball-databank@yahoogroups.com, Matthew Gargano <mgargano@...> wrote:
>
> Using standard database protocol, franchise ID should *never* change.
> Regardless of where the team moves.
>
> On Thu, Mar 1, 2012 at 9:53 AM, Sean Lahman <sl@...> wrote:
>
> >
> >
> > My plan is to use "MIA" as the teamID for the Miami Marlins. Don't think
> > BBRef or Retrosheet have weighed in yet.
> >
> > As far as the FranchiseID, my inclination is to leave it as is for now.
> >
> > Regards,
> > Sean
> >
> > ---
> > Sean Lahman
> > http://seanlahman.com
> >
> >
> >
> >
> >
> >
>

#4260 From: "chrislambrou" <chrislambrou@...>
Date: Tue Apr 17, 2012 1:34 pm
Subject: Re: Beta version of database available
chrislambrou
Send Email Send Email
 
I didn't see a response on this... Will DH no longer be included in the fielding
table?  Thanks.

-Chris

--- In baseball-databank@yahoogroups.com, John Rickert <rickert@...> wrote:
>
> In the Fielding.csv file previous seasons have totals for games played at DH,
but 2011 does not.
>

#4261 From: "N. S. Miceli, Ph.D." <micelin01@...>
Date: Thu Apr 19, 2012 2:52 pm
Subject: Question re how to treat data, statistically.
nsmiceli
Send Email Send Email
 
Dear group members,

Please excuse the cross posting. When examining more than one season's
worth of data, do you think that there is a need to examine the data
using time-series methods?

If this is too far off topic for the general group discussion, please
feel free to respond to me directly.

Regards,

Nick Miceli

#4262 From: "David J Wheeler" <dj.wheeler@...>
Date: Tue Apr 24, 2012 4:22 am
Subject: Newbie Help
dj.wheeler
Send Email Send Email
 
Being new to databases, I am wondering if someone might help me with
instructions on how to query career stats from the csv data provided recently.

I have tried opening the db file itself, but my computer locks up (as if the
file is too large to process. Is there a trick to this I don't know of?

All help is greatly appreciated.

#4263 From: "Tangotiger" <tom@...>
Date: Tue Apr 24, 2012 1:36 pm
Subject: Re: Newbie Help
tom@...
Send Email Send Email
 
You should download the Lahman MS Access database:

http://www.seanlahman.com/baseball-archive/statistics/

It'll make life easier for you.

Tom

> Being new to databases, I am wondering if someone might help me with
> instructions on how to query career stats from the csv data provided
> recently.
>
> I have tried opening the db file itself, but my computer locks up (as if
> the file is too large to process. Is there a trick to this I don't know
> of?
>
> All help is greatly appreciated.
>
>
>


---------------------------------------------
The Book--Playing The Percentages In Baseball
http://www.InsideTheBook.com

#4264 From: Matthew Gargano <mgargano@...>
Date: Tue Apr 24, 2012 1:03 pm
Subject: Re: Newbie Help
tkestars
Send Email Send Email
 
The easy way: use microsoft access

If you are using windows, the cheaper-and-not-as-easy-but-probably-better-way is to install XAMPP (or LAMP if you have a spare computer or want to use a virtual machine) and use something like phpmyadmin which provides an easy queryable interface. I'd recommend learning PHP, IMO it makes it a helluva lot easier to process the data. With PHP you can also use the baseball tools library I created (shameless plug) https://github.com/matstars/baseball-tools

Net/net, if you don't want to spend a fair amount of time, access is your best bet.


On Tue, Apr 24, 2012 at 12:22 AM, David J Wheeler <dj.wheeler@...> wrote:
Being new to databases, I am wondering if someone might help me with instructions on how to query career stats from the csv data provided recently.

I have tried opening the db file itself, but my computer locks up (as if the file is too large to process. Is there a trick to this I don't know of?

All help is greatly appreciated.




------------------------------------

http://www.baseball-databank.org/Yahoo! Groups Links

<*> To visit your group on the web, go to:
http://groups.yahoo.com/group/baseball-databank/

<*> Your email settings:
Individual Email | Traditional

<*> To change settings online go to:
http://groups.yahoo.com/group/baseball-databank/join
(Yahoo! ID required)

<*> To change settings via email:
baseball-databank-digest@yahoogroups.com
baseball-databank-fullfeatured@yahoogroups.com

<*> To unsubscribe from this group, send an email to:
baseball-databank-unsubscribe@yahoogroups.com

<*> Your use of Yahoo! Groups is subject to:
http://docs.yahoo.com/info/terms/



#4265 From: "FrankPereiro" <franpereiro@...>
Date: Thu May 3, 2012 7:52 pm
Subject: Working with different Databases
franpereiro
Send Email Send Email
 
Hi there,

I would like to merge (unite, join or however it's called) different databases.

We have a baseball site to highlight stats of latino players and we have found
some databases that we'd like to work with. We downloaded the mlb and npb
databases. And we're working on a venezuelan baseball, italian and dutch 
databases too.

But the problem are the id's of players, teams, managers, etc. The question is:
Is there a way to create one "main", batting, pitching, etc. table other than
doing it by hand?

I'm sorry if you don't understand the question, you sure can tell that english
is not my mother language.

Thanks in advanced for any tips.

Greetings,

#4266 From: "Clem Comly" <ccomly@...>
Date: Thu May 17, 2012 5:31 am
Subject: player IDs in Master but not in Appearances
ccomly2003
Send Email Send Email
 
The latest consistency check has 5 found when 2 expected.  Two of the 5 are cases apparently where the same player is accidentally in Master twice.  It looks to me that obrieje01 was merged into obriepe01 on the Retrosheet site.  Similarly whitecb01 was merged into whitebi02.  There are 1 or 2 Fielding
rows for each mergee and is 1 batting row.  I suspect smith02 who debuted 5/31/1886 was merged into another player but I am not sure who.
 
Clem Comly

#4267 From: KJOK <kjokbaseball@...>
Date: Fri May 18, 2012 5:00 am
Subject: Re: player IDs in Master but not in Appearances
kjokbaseball
Send Email Send Email
 
I think smith02 is now smithre01 - Rex Smith?
 
THANKS,
Kevin
From: Clem Comly <ccomly@...>
To: baseball-databank@yahoogroups.com
Sent: Thursday, May 17, 2012 12:31 AM
Subject: [baseball-databank] player IDs in Master but not in Appearances

 
The latest consistency check has 5 found when 2 expected.  Two of the 5 are cases apparently where the same player is accidentally in Master twice.  It looks to me that obrieje01 was merged into obriepe01 on the Retrosheet site.  Similarly whitecb01 was merged into whitebi02.  There are 1 or 2 Fielding
rows for each mergee and is 1 batting row.  I suspect smith02 who debuted 5/31/1886 was merged into another player but I am not sure who.
 
Clem Comly



#4268 From: "Clem Comly" <ccomly@...>
Date: Fri May 18, 2012 7:48 am
Subject: Re: player IDs in Batting but not in Appearances
ccomly2003
Send Email Send Email
 
I downloaded the ACCESS version of the DB early this week and hope it is in sync with other versions,
 
Alberto reported [query 09] There are 78 rows (season 2011) that are in the Batting table but not in the Appearances
.
 
I also found 78 rows for 2011 alone. 
The 78 rows appear to be players playing for 2 or more teams in 2011.  J.C. Romero has no PHI row for 2011 in Appearances but his COL row in Appearances has his combined totals for both teams.  Looking quickly, it appears most or all traded players are
missing an Appearances row. 
 
BTW a
ll of Youklis’ Appearances rows except 2011 have his old player ID.  The only 20th century problem is snydeal01 in Appearances s/b snydeja01.
There are 10 problem rows in 1880s.
 
Clem Comly

#4269 From: "Clem Comly" <ccomly@...>
Date: Fri May 18, 2012 6:02 am
Subject: Re: player IDs in Master but not in Appearances
ccomly2003
Send Email Send Email
 
Yes, Rex Smith.  Thanks, Kevin.  But all existing smithre01 records will need stint = 2 (Fielding,Pitching, Batting) before changing smith02 to smithre01.
 
These changes will reduce the count for query 5 from 5 to 2.  The 2 are Kiger (whose only ML experience was in post-season) and Hemond (winner Branch Rickey Award so in AwardsPlayers).
 
Clem Comly
 
From: KJOK
Sent: Friday, May 18, 2012 1:00 AM
Subject: Re: [baseball-databank] player IDs in Master but not in Appearances
 
 

I think smith02 is now smithre01 - Rex Smith?
 
THANKS,
Kevin

#4270 From: "Clem Comly" <ccomly@...>
Date: Sat May 19, 2012 9:40 pm
Subject: Can someone run consistency checks against 2012 version of DB?
ccomly2003
Send Email Send Email
 
I downloaded Access version and it doesn’t come with consistency queries and posted consistency queries each need manual updating to run in access.
 
For instance, I found an appearance row with a bad playerID.  For NYA for 2011 playerID “burneaj” s/b “burneaj01”.  I found no 2011 Fielding row for either player ID.
 
Clem Comly

#4271 From: "OrcomBoy" <gfhill@...>
Date: Mon May 28, 2012 12:22 am
Subject: Birth Dates differ between database and rosters
OrcomBoy
Send Email Send Email
 
Good day,
While I was merging the database with the current active rosters from mlb.com I
came across the following discrepancies in birth dates:

Name           Birth     ID        MLB BD
Ervin Santana     12/12/82  santaer01 1/10/83
Erik Bedard       3/5/79    bedarer01 3/6/79
Jose Arredondo    3/12/84   arredjo01 3/30/84
George Kottaras   5/10/83   kottage01 5/16/83
Freddy Garcia     10/6/76   garcifr02 6/10/75
Humberto Quintero 8/2/79    quinthu01 8/8/79
Matt Garza        11/26/83  garzama01 11/11/83
Aaron Crow        11/10/86  crowaa01  11/11/86
Gregor Blanco     12/24/83  blancgr01 12/12/83

Has anyone encountered this before? Is there any way to determine which is
correct? I was trying to calculate the average age by team, so I am using the
MLB date for now, but I am curious to see which is correct.

Thank you,
Greg Hill

#4272 From: KJOK <kjokbaseball@...>
Date: Mon May 28, 2012 6:14 pm
Subject: Re: Birth Dates differ between database and rosters
kjokbaseball
Send Email Send Email
 
Freddie Garcia's discrepancy is I think well known.  Apparently the date was originally mis-interpreted because 6/10 internationally is October 6th not June 10th, so I believe the October 10, 1976 is now considered the official date, although whether it's correct is another story.
 
THANKS,
Kevin
From: OrcomBoy <gfhill@...>
To: baseball-databank@yahoogroups.com
Sent: Sunday, May 27, 2012 7:22 PM
Subject: [baseball-databank] Birth Dates differ between database and rosters

 
Good day,
While I was merging the database with the current active rosters from mlb.com I came across the following discrepancies in birth dates:

Name Birth ID MLB BD
Ervin Santana 12/12/82 santaer01 1/10/83
Erik Bedard 3/5/79 bedarer01 3/6/79
Jose Arredondo 3/12/84 arredjo01 3/30/84
George Kottaras 5/10/83 kottage01 5/16/83
Freddy Garcia 10/6/76 garcifr02 6/10/75
Humberto Quintero 8/2/79 quinthu01 8/8/79
Matt Garza 11/26/83 garzama01 11/11/83
Aaron Crow 11/10/86 crowaa01 11/11/86
Gregor Blanco 12/24/83 blancgr01 12/12/83

Has anyone encountered this before? Is there any way to determine which is correct? I was trying to calculate the average age by team, so I am using the MLB date for now, but I am curious to see which is correct.

Thank you,
Greg Hill




#4273 From: "chrislambrou" <chrislambrou@...>
Date: Tue May 29, 2012 3:46 am
Subject: 2012 data?
chrislambrou
Send Email Send Email
 
Probably a dumb question but can anyone lead me to a good source for 2012 MLB
data?

I'd like to provide current stats and suspect there are many services that
provide this data (for a fee).  I'm curious if there are any other sources or
open source projects that I can parse 2012 data from?

Thanks -Chris.

#4274 From: "KJOK" <kjokbaseball@...>
Date: Tue May 29, 2012 6:46 pm
Subject: Re: Birth Dates differ between database and rosters
kjokbaseball
Send Email Send Email
 
From David Vincent via Rod Nelson, the SABR Biographical database (which
Baseball-Reference.com uses) has most of these resolved:

Player    SABR DB    MLB.com

Santana   12/12/1982 same

Bedard    03/05/1979 same

Arredondo 03/12/1984 same

Kottaras  05/16/1983 05/10/1983

Garcia    10/06/1976 same

Quintero  08/02/1979 same

Garza     11/26/1983 same

Crow      11/11/1986 11/10/1986

Blanco    12/24/1983 same


THANKS,
Kevin Johnson
www.seamheads.com


--- In baseball-databank@yahoogroups.com, KJOK <kjokbaseball@...> wrote:
>
> Freddie Garcia's discrepancy is I think well known.  Apparently the date was
originally mis-interpreted because 6/10 internationally is October 6th not June
10th, so I believe the October 10, 1976 is now considered the official date,
although whether it's correct is another story.
>
> THANKS,
> Kevin
>
>
> >________________________________
> > From: OrcomBoy <gfhill@...>
> >To: baseball-databank@yahoogroups.com
> >Sent: Sunday, May 27, 2012 7:22 PM
> >Subject: [baseball-databank] Birth Dates differ between database and rosters
> >
> >
> >
> > 
> >
> >Good day,
> >While I was merging the database with the current active rosters from mlb.com
I came across the following discrepancies in birth dates:
> >
> >Name           Birth     ID        MLB BD
> >Ervin Santana     12/12/82  santaer01 1/10/83
> >Erik Bedard       3/5/79    bedarer01 3/6/79
> >Jose Arredondo    3/12/84   arredjo01 3/30/84
> >George Kottaras   5/10/83   kottage01 5/16/83
> >Freddy Garcia     10/6/76   garcifr02 6/10/75
> >Humberto Quintero 8/2/79    quinthu01 8/8/79
> >Matt Garza        11/26/83  garzama01 11/11/83
> >Aaron Crow        11/10/86  crowaa01  11/11/86
> >Gregor Blanco     12/24/83  blancgr01 12/12/83
> >
> >Has anyone encountered this before? Is there any way to determine which is
correct? I was trying to calculate the average age by team, so I am using the
MLB date for now, but I am curious to see which is correct.
> >
> >Thank you,
> >Greg Hill
> >
> >
> >
> >
> >
>

#4275 From: Sean Forman <sean-forman@...>
Date: Tue May 29, 2012 7:03 pm
Subject: Re: Re: Birth Dates differ between database and rosters
sforman71
Send Email Send Email
 

sean
---
Sean Forman
Sports Reference LLC, President
http://www.sports-reference.com/



On Tue, May 29, 2012 at 2:46 PM, KJOK <kjokbaseball@...> wrote:

From David Vincent via Rod Nelson, the SABR Biographical database (which Baseball-Reference.com uses) has most of these resolved:

Player SABR DB MLB.com

Santana 12/12/1982 same

Bedard 03/05/1979 same

Arredondo 03/12/1984 same

Kottaras 05/16/1983 05/10/1983

Garcia 10/06/1976 same

Quintero 08/02/1979 same

Garza 11/26/1983 same

Crow 11/11/1986 11/10/1986

Blanco 12/24/1983 same

THANKS,
Kevin Johnson
www.seamheads.com

--- In baseball-databank@yahoogroups.com, KJOK <kjokbaseball@...> wrote:
>
> Freddie Garcia's discrepancy is I think well known.  Apparently the date was originally mis-interpreted because 6/10 internationally is October 6th not June 10th, so I believe the October 10, 1976 is now considered the official date, although whether it's correct is another story.
>
> THANKS,
> Kevin
>
>
> >________________________________
> > From: OrcomBoy <gfhill@...>


> >To: baseball-databank@yahoogroups.com
> >Sent: Sunday, May 27, 2012 7:22 PM
> >Subject: [baseball-databank] Birth Dates differ between database and rosters
> >
> >
> >
> > 
> >
> >Good day,
> >While I was merging the database with the current active rosters from mlb.com I came across the following discrepancies in birth dates:
> >
> >Name Birth ID MLB BD
> >Ervin Santana 12/12/82 santaer01 1/10/83
> >Erik Bedard 3/5/79 bedarer01 3/6/79
> >Jose Arredondo 3/12/84 arredjo01 3/30/84
> >George Kottaras 5/10/83 kottage01 5/16/83
> >Freddy Garcia 10/6/76 garcifr02 6/10/75
> >Humberto Quintero 8/2/79 quinthu01 8/8/79
> >Matt Garza 11/26/83 garzama01 11/11/83
> >Aaron Crow 11/10/86 crowaa01 11/11/86
> >Gregor Blanco 12/24/83 blancgr01 12/12/83
> >
> >Has anyone encountered this before? Is there any way to determine which is correct? I was trying to calculate the average age by team, so I am using the MLB date for now, but I am curious to see which is correct.
> >
> >Thank you,
> >Greg Hill
> >
> >
> >
> >
> >
>



#4276 From: "chrislambrou" <chrislambrou@...>
Date: Tue Jun 5, 2012 2:42 pm
Subject: Draft data
chrislambrou
Send Email Send Email
 
I don't see any draft info in the master table and no external files that seem
to contain this - is there a source for draft data elsewhere?

Is this one of those times I should contribute to the project and compile my own
to upload?

Thanks,
-Chris

#4277 From: "railsplitter_44" <danielghirsch@...>
Date: Tue Jun 5, 2012 9:45 pm
Subject: Re: Draft data
railsplitter_44
Send Email Send Email
 
The only free data that I know of is from the retrosheet transactions database. 
But this will only contain draft info from players who made it to the major
leagues.  But since most players don't make it, a lot of the draft picks will be
missing.

If you are willing to contribute the remaining data, I'm sure we all would be
appreciative.  But I can imagine that manually adding this data would take a lot
of time.

If you are looking for a starting point with the retrosheet data, send me an
email (danielghirsch@...) and I'll send you an excel sheet that has this
data with databank's player and teamID's

Dan Hirsch



--- In baseball-databank@yahoogroups.com, "chrislambrou" <chrislambrou@...>
wrote:
>
> I don't see any draft info in the master table and no external files that seem
to contain this - is there a source for draft data elsewhere?
>
> Is this one of those times I should contribute to the project and compile my
own to upload?
>
> Thanks,
> -Chris
>

#4278 From: KJOK <kjokbaseball@...>
Date: Tue Jun 5, 2012 10:13 pm
Subject: Re: Re: Draft data
kjokbaseball
Send Email Send Email
 
The data has been compiled, as you can see here:
 
 
However, to my knowledge that dataset has not been publicly released.
 
THANKS,
Kevin
From: railsplitter_44 <danielghirsch@...>
To: baseball-databank@yahoogroups.com
Sent: Tuesday, June 5, 2012 4:45 PM
Subject: [baseball-databank] Re: Draft data

 
The only free data that I know of is from the retrosheet transactions database. But this will only contain draft info from players who made it to the major leagues. But since most players don't make it, a lot of the draft picks will be missing.

If you are willing to contribute the remaining data, I'm sure we all would be appreciative. But I can imagine that manually adding this data would take a lot of time.

If you are looking for a starting point with the retrosheet data, send me an email (danielghirsch@...) and I'll send you an excel sheet that has this data with databank's player and teamID's

Dan Hirsch

--- In baseball-databank@yahoogroups.com, "chrislambrou" <chrislambrou@...> wrote:
>
> I don't see any draft info in the master table and no external files that seem to contain this - is there a source for draft data elsewhere?
>
> Is this one of those times I should contribute to the project and compile my own to upload?
>
> Thanks,
> -Chris
>




#4279 From: Clay Dreslough <cjd@...>
Date: Sun Apr 1, 2012 8:19 pm
Subject: Wild Card Showdown Stints
upa2112
Send Email Send Email
 
Speaking of stints, is there a consensus on the abbreviation to use for
the single-game wild card round when displaying player stats?

For example, the current abbreviations are ALDS, NLDS, ALCS, NLCS and WS.

Like this:

http://bit.ly/HJ5beA

I don't have the luxury of waiting until October. So, FWIW, we're using
'ALWC' and 'NLWC' to label stat lines for the Wild Card Showdown.

Clay

#4280 From: "chrislambrou" <chrislambrou@...>
Date: Sun May 13, 2012 4:24 am
Subject: Bug - ramon ortiz - appearences table
chrislambrou
Send Email Send Email
 
PlayerID for year 2011 is 'ortizra02' instead of 'ortizra01' in the appearances
table only.

Please let me know if I am missing something.  Thanks.
-Chris

Messages 4251 - 4280 of 4385   Oldest  |  < Older  |  Newer >  |  Newest
Add to My Yahoo!      XML What's This?

Copyright 2010 Yahoo! Inc. All rights reserved.
Privacy Policy - Terms of Service - Guidelines NEW - Help