> The hardest/longest part is to match the various IDs
> to the BDB ID. That's where the bulk of the work
> comes from. So, if your source is MLB (through
> MLB.com) or STATS (through ESPN.com) or whatever other
> sources supply CBS, Yahoo, etc, they key is getting
> the player IDs for those sources to match against BDB
> ID.
>
> As soon as we get that, then we are talking about 1
> hour to get the data from internet to BDB.
Before anyone undertakes this work, there are some legal issues that
generally prevent us from incorporating player ID's from commercial sources.
Also, Sean Forman and I have methods and sources in place for compiling the
2003 stats when the season ends. Simply mining the stats from one of those
sites presents some of the same legal issues, but more importantly accuracy
issues.
Regards,
Sean Lahman