Sean Lahman wrote:
> > The hardest/longest part is to match the various IDs
> > to the BDB ID. That's where the bulk of the work
> > comes from. So, if your source is MLB (through
> > MLB.com) or STATS (through ESPN.com) or whatever other
> > sources supply CBS, Yahoo, etc, they key is getting
> > the player IDs for those sources to match against BDB
> > ID.
> >
> > As soon as we get that, then we are talking about 1
> > hour to get the data from internet to BDB.
>
>
> Before anyone undertakes this work, there are some legal issues that
> generally prevent us from incorporating player ID's from commercial sources.
> Also, Sean Forman and I have methods and sources in place for compiling the
> 2003 stats when the season ends. Simply mining the stats from one of those
> sites presents some of the same legal issues, but more importantly accuracy
> issues.
>
> Regards,
> Sean Lahman
I want to echo Sean here. There are mechanisms already in place and the
and 2004 stats should come out fairly quickly. It will be covered.
Also, I have incorporated SABR's collegiate data into the BDB, should be
releasing it sometime soon and I hope to set aside a weekend to handle
all of the outstanding error issues prior to the 2004 release.
--
Sincerely,
Sean Forman
Baseball Stats! http://www.Baseball-Reference.com/