(Cross-posting to Retrolist and BDB) This is to point out a discrepancy, without knowing which is correct. The Lahman DB and B-R.com has Jim Busby with one...
Thank you for showing that the best answer is found in the easiest of places. Tom ... From: J. G. Preston <jgpreston@...> Subject: Re: [RetroList] Jiim...
I have compiled additional batting and pitching statistics from Retrosheet Play by Play data and wanted to share them here. I've linked the Retro ID's to the...
Dave - Thanks so much for your efforts from the baseball databank group! I was wondering if you could show me a thing or two. Also I was wondering if you knew...
Matt, I don't know anything about scraping data. The basic data (AB,H,...)is already in the BDB database. Unless you wanted to derive the basic data for 2008....
... For pre-1980 years, the differences between these will blur in the data, and you won't have hits. I'd just go for: Air Outs - Pulled Ground Outs - Pulled ...
Charles, Most of your suggestions shouldn't be difficult to implement. I had a feeling that I should have used outs for batted balls. Although seperating "Outs...
I guess that I could throw out some of the baserunning categories. Here's my new list: Batting: XI ROE ROE SH ROE SF RFC RFC SH PkO CS PkO PkO Error GB FB LD ...
Hey guys...I don't know if anyone else noticed this since it's not a huge deal 99% of the time, but for some reason in the 2007 season (most recent in the...
We've longed discussed the idea of an APPEARANCES table, so that it captures exactly what it's supposed to capture, rather than taking over the BATTING table...
Tangotiger
tom@...
Oct 7, 2008 8:28 pm
3607
That's the way I would approach it as well if I were the guy doing the building of the DB...I highly recommend Lahman/Forman work toward that goal. It sounds...
... There are no current plans, that's correct. How/when this will be rectified is undetermined. Tom ... The Book--Playing The Percentages In Baseball ...
Tangotiger
tom@...
Oct 8, 2008 1:59 pm
3609
OK...thanks Tom. I'm going to ask our web coder to McGuyver an appearances table...that's the correct way to normalize the database anyway....
... some of the ... MySQL ... fields are ... range of ... int(4), ... as yearID ... Similarly, ... of ... season (or ... of 0 - ... playing ... been able ... ...
... If the nature of the data is that it is unsigned, it should be stored in an unsigned column. If the nature of your application is that you use the values...
Regarding the tinyint and int and the like issue: I disagree. I'll give you two good reasons: 1. An 8 GB flash drive costs $29. A 500 GB backup drive costs...
Tangotiger
tom@...
Oct 22, 2008 1:20 am
3614
1952 Bud Black for Detroit-- his season IP total should be 7.2 not 8. Daily on microfilm has 7.2 for him and all the other Detroit Ps IPs are OK (seems to me...
I've made good progress on the BDB update for this year and should have something soon. I'm adding an Appearances table with the following schema (this will...
I think we've discussed having an appearance or roster table before, and talked about it as a very good idea.  I'm less clear about the need for the Batting...
... I see this as backwards compatibility. I know that lost of people use the tables as is and historically G has been all games played. This will leave ...
It is usually desirable to go vertically, than horizontally, like so: http://sports.groups.yahoo.com/group/baseball-databank/message/3606 You can create more...
Sean: That schema sounds great to me. Just a quick question - I did some research and searched the Yahoo! group and couldn't find the answer to this. There...
Who is "SABRE", and can you cite their reasoning? Tom ... The Book--Playing The Percentages In Baseball http://www.InsideTheBook.com...
Tangotiger
tom@...
Nov 11, 2008 5:06 pm
3622
I've updated a BDB update today. Here are the release notes. One should consider this a proposed release as I would like lots of review to point out any...
Two hours without comment? I hope that's because we are in shock and not passe over this. Anyway, a huge thanks to Sean for delivering the data. I am going...