Skip to search.

Breaking News Visit Yahoo! News for the latest.

×Close this window

baseball-databank · Baseball Databank

The Yahoo! Groups Product Blog

Check it out!

Group Information

? Already a member? Sign in to Yahoo!

Yahoo! Groups Tips

Did you know...
Hear how Yahoo! Groups has changed the lives of others. Take me there.

Messages

Advanced
Messages Help
Messages 3654 - 3683 of 4385   Oldest  |  < Older  |  Newer >  |  Newest
Messages: Show Message Summaries Sort by Date ^  
#3654 From: "Tangotiger" <tom@...>
Date: Thu Nov 20, 2008 7:14 pm
Subject: Re: Re: Player ID for new players in 2008
tom@...
Send Email Send Email
 
No need.  Like I said, I sent it out somewhere, probably not here. Plus, I
sent out so many notes in those few days, who knows exactly what I was
saying.  I should have been more economical with my posts.

In any case, I'm glad that you gave it a second review.

Tom

> Sorry, I just missed that note.
>
> sean
>
>
>
>
> On Thu, Nov 20, 2008 at 11:06 AM, Tangotiger <tom@...> wrote:
>
>>   Hmmm... I made the announcement on my blog with the updated file, I
>> think
>> I made the announcement at Retrolist, and I guess I overlooked making
>> the
>> announcement here. Two out of three and all that...
>>
>> Thanks to Sean for the alert.
>>
>> The correct IDs, as well as the most up-to-date DB shell script, will
>> always be found here:
>>
>> http://tangotiger.net/bdb/
>>
>> Tom
>>
>>
>>
>
>
>
> --
> Sean Forman
> President, Sports Reference LLC
> http://www.sports-reference.com/
>


---------------------------------------------
The Book--Playing The Percentages In Baseball
http://www.InsideTheBook.com

#3655 From: "Clem Comly" <ccomly@...>
Date: Fri Nov 21, 2008 3:49 am
Subject: Release 2008
ccomly2003
Send Email Send Email
 
Speaking of missing notes I see the unballanced HB between 1955 AL pitchers and hitters that I corrected and a few of my other notes are not reflected in the release I downloaded from baseball-databank.org tonight.  Was there a problem with them?
 
Clem Comly

#3656 From: "Tangotiger" <tom@...>
Date: Fri Nov 21, 2008 3:32 pm
Subject: Re: Re: Player ID for new players in 2008
tom@...
Send Email Send Email
 
I should highlight that in that folder, I have the primary positions file
for every player/season.

What I did *not* do was for the BDB shell script to import that file
automatically.  I could, but I didn't.  The reason was for that shell
script to only import the data that directly corresponds to the "official"
tables in the BDB.  (Note: it is very easy to import it manually, in
Access: just click NEW/Import Table.)

I could expand, for example, by including wOBA or LWTS, or creating a
"BattingNoStint" table to group the records to get rid of the stint field.
  Really, there's no end to what we can do in terms of making the DB more
friendly.

Perhaps I will make an exception for this particular case, simply because
it's a fairly involved process to try to get the primary position.  If
there are other useful things that can be generated (that would require a
fairly involved process), please post it, and I'll consider it.

Thanks, Tom


> The correct IDs, as well as the most up-to-date DB shell script, will
> always be found here:
>
> http://tangotiger.net/bdb/
>
> Tom
>
>
>

#3657 From: "Sean Forman" <sean-forman@...>
Date: Fri Nov 21, 2008 6:12 pm
Subject: Re: Release 2008
sforman71
Send Email Send Email
 


On Thu, Nov 20, 2008 at 10:49 PM, Clem Comly <ccomly@...> wrote:

Speaking of missing notes I see the unballanced HB between 1955 AL pitchers and hitters that I corrected and a few of my other notes are not reflected in the release I downloaded from baseball-databank.org tonight.  Was there a problem with them?
 
Clem Comly









Clem,

It was purely a matter of trying to get the 2008 data out.  I definitely want to get your data in there, but just didn't have the time to.

sean


--
Sean Forman
President, Sports Reference LLC
http://www.sports-reference.com/

#3658 From: "Tangotiger" <tom@...>
Date: Fri Nov 21, 2008 6:31 pm
Subject: Re: Release 2008
tom@...
Send Email Send Email
 
> Clem,
>
> It was purely a matter of trying to get the 2008 data out.  I definitely
> want to get your data in there, but just didn't have the time to.
>
> sean
>

What might be helpful is if someone creates a "2008" folder in the groups
section, and then all files that have not yet been incorporated can be
posted there.  This way, whoever has the time can provide a sql script or
set of commands to do whatever updates or creates are needed, much for
example, the way the new RETROids were incorporated.

Tom

#3659 From: "Sean Forman" <sean-forman@...>
Date: Fri Nov 21, 2008 8:07 pm
Subject: UPdated the database with the latest for awards
sforman71
Send Email Send Email
 
Not much else to say.

--
Sean Forman
President, Sports Reference LLC
http://www.sports-reference.com/

#3660 From: "wyerscj" <PontifexExMachina@...>
Date: Sat Nov 22, 2008 4:44 pm
Subject: Mistake in player's weight
wyerscj
Send Email Send Email
 
For goldswa02 (Walt Goldsby), his listed weight is 1658.

--CW

#3661 From: "Dave Carter" <terpsfan101@...>
Date: Sat Nov 29, 2008 7:51 am
Subject: Wrong Retrosheet teamID for the Angels in 2007, 2008
terpsfan101
Send Email Send Email
 
Mistake Location: Teams Table

Mistake Field: teamIDretro

Mistake Description: The Anaheim Angels retrosheet ID is ANA from 1997
to 2008. In the Teams Table, ALA is listed for 2007 and 2008.

#3662 From: "Clem Comly" <ccomly@...>
Date: Wed Dec 10, 2008 2:53 pm
Subject: Re:2008 DH games stat
ccomly2003
Send Email Send Email
 
Pete Palmer has assured me that the BIS data is correct.  But obviously that is not the source of data for mlb.com, baseball-reference, and baseball-databank.  He reports the sum of DH G for all AL teams is 2454 and NL teams is 132 which is far more than those 3 on-line sources seem to indicate.
 
Clem Comly


#3663 From: "tangotiger" <tangotiger@...>
Date: Thu Dec 11, 2008 12:48 am
Subject: Re:2008 DH games stat
tangotiger
Send Email Send Email
 
--- In baseball-databank@yahoogroups.com, "Clem Comly" <ccomly@...> wrote:
>
> Pete Palmer has assured me that the BIS data is correct.  But
obviously that is not the source of data for mlb.com,
baseball-reference, and baseball-databank.  He reports the sum of DH G
for all AL teams is 2454 and NL teams is 132 which is far more than
those 3 on-line sources seem to indicate.
>
> Clem Comly
>

I think I missed a post in here.  What is this in reply?

Anyway, I get a total of 2142 DH games for AL and 132 for NL.

Sanity check: there are 9 (I think) games each AL team doesn't play in
an AL park.  So, 153*14= 2142

Maybe I only have the "starts" for DH, and not the PH/DH games?  I'll
check.

Tom

#3664 From: Tangotiger <tangotiger@...>
Date: Thu Dec 11, 2008 1:02 am
Subject: Re: Re:2008 DH games stat
tangotiger
Send Email Send Email
 
The totals I reported were both from a separate source and the BDB.

When I check Retrosheet, I get a total of 2268 DH games, which compares to the 2142+132 = 2274.

Anyway, I'll stop now, since I missed the first part of this discussion.

Tom










---------------------------------------------

--- On Wed, 12/10/08, tangotiger <tangotiger@...> wrote:
From: tangotiger <tangotiger@...>
Subject: [baseball-databank] Re:2008 DH games stat
To: baseball-databank@yahoogroups.com
Date: Wednesday, December 10, 2008, 7:48 PM

--- In baseball-databank@ yahoogroups. com, "Clem Comly" <ccomly@...> wrote:
>
> Pete Palmer has assured me that the BIS data is correct. But
obviously that is not the source of data for mlb.com,
baseball-reference, and baseball-databank. He reports the sum of DH G
for all AL teams is 2454 and NL teams is 132 which is far more than
those 3 on-line sources seem to indicate.
>
> Clem Comly
>

I think I missed a post in here. What is this in reply?

Anyway, I get a total of 2142 DH games for AL and 132 for NL.

Sanity check: there are 9 (I think) games each AL team doesn't play in
an AL park. So, 153*14= 2142

Maybe I only have the "starts" for DH, and not the PH/DH games? I'll
check.

Tom



#3665 From: "Rod Nelson" <rodericnelson@...>
Date: Sun Dec 14, 2008 10:10 pm
Subject: Happy Birthday, Sean~
rockymtnsabr
Send Email Send Email
 
#3666 From: Brian Borawski <brianbor@...>
Date: Sun Dec 14, 2008 10:35 pm
Subject: Re: Happy Birthday, Sean~
brianbor
Send Email Send Email
 
It doesn't say his age.  Or is that top secret?

--- On Sun, 12/14/08, Rod Nelson <rodericnelson@...> wrote:
From: Rod Nelson <rodericnelson@...>
Subject: [baseball-databank] Happy Birthday, Sean~
To: UnionProject@yahoogroups.com, baseball-databank@yahoogroups.com
Date: Sunday, December 14, 2008, 5:10 PM



#3667 From: "studes" <studes@...>
Date: Wed Dec 17, 2008 8:56 pm
Subject: Lahman database
studes
Send Email Send Email
 
I was wondering when the Lahman database is going to be ready for
public consumption.  The website still says end of November, which
was, you know, a while ago.

Thanks,
dave

#3668 From: Ed R <reallyrottens_98@...>
Date: Wed Dec 17, 2008 9:04 pm
Subject: Re: Lahman database
reallyrotten...
Send Email Send Email
 
I think it is already available here:
http://www.baseball-databank.org/

I put the sql version onto Mysql without problems and found Evan Longoria.

--- On Wed, 12/17/08, studes <studes@...> wrote:
From: studes <studes@...>
Subject: [baseball-databank] Lahman database
To: baseball-databank@yahoogroups.com
Date: Wednesday, December 17, 2008, 2:56 PM

I was wondering when the Lahman database is going to be ready for
public consumption. The website still says end of November, which
was, you know, a while ago.

Thanks,
dave



#3669 From: "Tangotiger" <tom@...>
Date: Wed Dec 17, 2008 9:04 pm
Subject: Re: Lahman database
tom@...
Send Email Send Email
 
> I was wondering when the Lahman database is going to be ready for
> public consumption.  The website still says end of November, which
> was, you know, a while ago.
>
> Thanks,
> dave
>
>

I suggest you download the Access database here:
http://tangotiger.net/bdb/

Follow the instructions, and in a few minutes, you'll have a working copy
that is equivalent to the Lahman database.

Tom

---------------------------------------------
The Book--Playing The Percentages In Baseball
http://www.InsideTheBook.com

#3670 From: Dave <studes@...>
Date: Thu Dec 18, 2008 1:59 am
Subject: Re: Lahman database
studes
Send Email Send Email
 
Thanks for the replies, but I like the simplicity of the Lahman output -- no
macros or interpreting SQL files or anything.

dave

#3671 From: "ctomarkin" <CTOMARKIN@...>
Date: Mon Dec 15, 2008 7:11 pm
Subject: DL data
ctomarkin
Send Email Send Email
 
Has anyone complied number of days on DL data? Number of games missed
due to being on DL?

That data would be very interesting to consider when forecasting
player stats.

Craig Tomarkin

#3672 From: KJOK <kjokbaseball@...>
Date: Fri Dec 19, 2008 3:43 pm
Subject: Re: DL data
kjokbaseball
Send Email Send Email
 
Craig:
 
This doesn't directly answer your question, but there has been some discussion about adding DL and other types of 'off' time to the transactions database over on that egroup:
 
 
if you want to search the archives there.
 
THANKS,
KJOK


--- On Mon, 12/15/08, ctomarkin <CTOMARKIN@...> wrote:
From: ctomarkin <CTOMARKIN@...>
Subject: [baseball-databank] DL data
To: baseball-databank@yahoogroups.com
Date: Monday, December 15, 2008, 1:11 PM

Has anyone complied number of days on DL data? Number of games missed
due to being on DL?

That data would be very interesting to consider when forecasting
player stats.

Craig Tomarkin



#3673 From: "Matthew Gargano" <mgargano@...>
Date: Fri Dec 19, 2008 3:49 pm
Subject: Re: DL data
tkestars
Send Email Send Email
 
Has anyone complied minor league stats into a SQL/MDB database?


#3674 From: "wydiyd" <wydiyd@...>
Date: Fri Dec 19, 2008 4:03 pm
Subject: Re: DL data
wydiyd
Send Email Send Email
 
I looked for the data and the only place I could find any data is at:

http://www.baseball-injury-report.com/about.shtml

It is a pay site, but all that I could currently find.

--- In baseball-databank@yahoogroups.com, KJOK <kjokbaseball@...> wrote:
>
> Craig:
>  
> This doesn't directly answer your question, but there has been some
discussion about adding DL and other types of 'off' time to the
transactions database over on that egroup:
>  
> http://sports.groups.yahoo.com/group/BBTransactions/?yguid=84616618
>  
> if you want to search the archives there.
>  
> THANKS,
> KJOK
>
>
> --- On Mon, 12/15/08, ctomarkin <CTOMARKIN@...> wrote:
>
> From: ctomarkin <CTOMARKIN@...>
> Subject: [baseball-databank] DL data
> To: baseball-databank@yahoogroups.com
> Date: Monday, December 15, 2008, 1:11 PM
>
>
>
>
>
>
> Has anyone complied number of days on DL data? Number of games missed
> due to being on DL?
>
> That data would be very interesting to consider when forecasting
> player stats.
>
> Craig Tomarkin
>

#3675 From: "Theodore Turocy" <drarbiter@...>
Date: Fri Dec 19, 2008 5:46 pm
Subject: Re: DL data
arb1ter
Send Email Send Email
 
On Fri, Dec 19, 2008 at 9:49 AM, Matthew Gargano <mgargano@...> wrote:
> Has anyone complied minor league stats into a SQL/MDB database?

SABR is doing just that:

http://minors.sabrwebs.com

I will be releasing my annual CSV files of the previous year's minor
league statistics sometime in the next month, including all North
American leagues, plus Japan, Korea, Italy, and the Netherlands.

Ted

#3676 From: KJOK <kjokbaseball@...>
Date: Fri Dec 19, 2008 7:37 pm
Subject: Re: Re: DL data
kjokbaseball
Send Email Send Email
 
Maybe I wasn't clear.
 
The data currently, as far as I know, it not PUBLICLY available.  There have been some private compilations that I know of.
 
However, there has been some discussion on the transactions egroup about compiling such data.  To do that, volunteers are of course needed to research and provide those transactions so they can put into a publicly available database.
 
THANKS,
KJOK

--- On Fri, 12/19/08, wydiyd <wydiyd@...> wrote:
From: wydiyd <wydiyd@...>
Subject: [baseball-databank] Re: DL data
To: baseball-databank@yahoogroups.com
Date: Friday, December 19, 2008, 10:03 AM

I looked for the data and the only place I could find any data is at:

http://www.baseball -injury-report. com/about. shtml

It is a pay site, but all that I could currently find.

--- In baseball-databank@ yahoogroups. com, KJOK <kjokbaseball@ ...> wrote:
>
> Craig:
>  
> This doesn't directly answer your question, but there has been some
discussion about adding DL and other types of 'off' time to the
transactions database over on that egroup:
>  
> http://sports. groups.yahoo. com/group/ BBTransactions/ ?yguid=84616618
>  
> if you want to search the archives there.
>  
> THANKS,
> KJOK
>
>
> --- On Mon, 12/15/08, ctomarkin <CTOMARKIN@. ..> wrote:
>
> From: ctomarkin <CTOMARKIN@. ..>
> Subject: [baseball-databank] DL data
> To: baseball-databank@ yahoogroups. com
> Date: Monday, December 15, 2008, 1:11 PM
>
>
>
>
>
>
> Has anyone complied number of days on DL data? Number of games missed
> due to being on DL?
>
> That data would be very interesting to consider when forecasting
> player stats.
>
> Craig Tomarkin
>



#3677 From: "studes" <studes@...>
Date: Sat Dec 20, 2008 10:24 pm
Subject: Kaaihue
studes
Send Email Send Email
 
I went ahead and downloaded Tango's 2008 files (thank you, Tango) and
noticed something.  Kila Kaaihue (or Ka'aihue) had 24 plate
appearances with the Royals this year.  His record is in the batting
table, but he's not listed in the Master table.

I don't know who tracks these things, so I am just posting it to the list.

dave

#3678 From: "studes" <studes@...>
Date: Sat Dec 20, 2008 10:26 pm
Subject: never mind
studes
Send Email Send Email
 
I found him (Ka'aihue).  Stupid apostrophe!

dave

#3679 From: Michael Greene <mfgreene79@...>
Date: Sat Dec 20, 2008 3:48 am
Subject: Re: Re: DL data
mfgreene79
Send Email Send Email
 
The baseball transactions are all listed on the MLB website (going back to 2001).  I have done some work creating a web spider and trying to match the names back up to the player lists.  If this information is of interest, I can try and create an extract to get it up.  I don't use MYSQL (I'm a fan of Postgres), but I could probably set it up with inserts instead.

Mike  


  
On Dec 19, 2008, at 11:03 AM, wydiyd wrote:

I looked for the data and the only place I could find any data is at:

http://www.baseball-injury-report.com/about.shtml

It is a pay site, but all that I could currently find.

--- In baseball-databank@yahoogroups.com, KJOK <kjokbaseball@...> wrote:
>
> Craig:
>  
> This doesn't directly answer your question, but there has been some
discussion about adding DL and other types of 'off' time to the
transactions database over on that egroup:
>  
> http://sports.groups.yahoo.com/group/BBTransactions/?yguid=84616618
>  
> if you want to search the archives there.
>  
> THANKS,
> KJOK
> 
> 
> --- On Mon, 12/15/08, ctomarkin <CTOMARKIN@...> wrote:
> 
> From: ctomarkin <CTOMARKIN@...>
> Subject: [baseball-databank] DL data
> To: baseball-databank@yahoogroups.com
> Date: Monday, December 15, 2008, 1:11 PM
> 
> 
> 
> 
> 
> 
> Has anyone complied number of days on DL data? Number of games missed
> due to being on DL? 
> 
> That data would be very interesting to consider when forecasting
> player stats. 
> 
> Craig Tomarkin
>



#3680 From: "Matthew Gargano" <mgargano@...>
Date: Wed Dec 17, 2008 9:28 pm
Subject: Re: Lahman database
tkestars
Send Email Send Email
 
What about Eva Longoria?  I'd be more interested in her.

On Wed, Dec 17, 2008 at 4:04 PM, Ed R <reallyrottens_98@...> wrote:
I think it is already available here:I put the sql version onto Mysql without problems and found Evan Longoria.

--- On Wed, 12/17/08, studes <studes@...> wrote:
From: studes <studes@...>
Subject: [baseball-databank] Lahman database
To: baseball-databank@yahoogroups.com
Date: Wednesday, December 17, 2008, 2:56 PM


I was wondering when the Lahman database is going to be ready for
public consumption. The website still says end of November, which
was, you know, a while ago.

Thanks,
dave




#3681 From: "mfgreene79" <mfgreene79@...>
Date: Tue Dec 23, 2008 3:38 am
Subject: Re: DL data
mfgreene79
Send Email Send Email
 
All,

Just uploaded the MLB transaction data, parsed out from the MLB
website.  The data includes all player transactions from 2001 up
through the end of October 2008.  Transactions are injuries, trades,
moves to the minors and back etc.

The data is in free form text, but there are some patterns.  I've
coded some rules to parse the information out into some basic
categories (to the DL, to the minors, trades, etc).  One other issue
is that the transactions are given by player name, no identifying
number is given.  I've tried to add the MLBID (sometimes called the
mlbam), which is used as a key to player sites on the MLB site.  The
table is rife with errors and mistakes, but some of the data is really
messy (especially in 2001-2002).

The layout of the table is below.  The file is actually a dump from a
Postgres database, but it should be basically compatible with MySQL
and other databases.  The file has the CREATE TABLE statements and
then loads the data with INSERT statements.  There are 21,677 records
in the data.

I'm all for free data, enjoy!


                  Table "mlb.mlb_trans"
    Column    |            Type             | Modifiers
-------------+-----------------------------+-----------
  transid     | integer                     |
  playername  | text                        |
  playerpos   | text                        |
  playerid    | integer                     |
  teamabbr    | text                        |
  trans_date  | date                        |
  trans_txt   | text                        |
  to_dl_ind   | smallint                    |
  from_dl_ind | smallint                    |
  dl_txt      | text                        |
  minors_ind  | smallint                    |
  minors_txt  | text                        |
  injury      | smallint                    |
  injury_txt  | text                        |
  callup_ind  | smallint                    |
  callup_txt  | text                        |
  sign_ind    | smallint                    |
  sign_txt    | text                        |
  insertdate  | timestamp without time zone |
  lost_ind    | smallint                    |
  lost_txt    | text                        |


Thanks,

Mike




--- In baseball-databank@yahoogroups.com, Michael Greene
<mfgreene79@...> wrote:
>
> The baseball transactions are all listed on the MLB website (going
> back to 2001).  I have done some work creating a web spider and
> trying to match the names back up to the player lists.  If this
> information is of interest, I can try and create an extract to get it
> up.  I don't use MYSQL (I'm a fan of Postgres), but I could probably
> set it up with inserts instead.
>
> Mike
>
>
>
> On Dec 19, 2008, at 11:03 AM, wydiyd wrote:
>
> > I looked for the data and the only place I could find any data is at:
> >
> > http://www.baseball-injury-report.com/about.shtml
> >
> > It is a pay site, but all that I could currently find.
> >
> > --- In baseball-databank@yahoogroups.com, KJOK <kjokbaseball@>
> > wrote:
> > >
> > > Craig:
> > >
> > > This doesn't directly answer your question, but there has been some
> > discussion about adding DL and other types of 'off' time to the
> > transactions database over on that egroup:
> > >
> > > http://sports.groups.yahoo.com/group/BBTransactions/?yguid=84616618
> > >
> > > if you want to search the archives there.
> > >
> > > THANKS,
> > > KJOK
> > >
> > >
> > > --- On Mon, 12/15/08, ctomarkin <CTOMARKIN@> wrote:
> > >
> > > From: ctomarkin <CTOMARKIN@>
> > > Subject: [baseball-databank] DL data
> > > To: baseball-databank@yahoogroups.com
> > > Date: Monday, December 15, 2008, 1:11 PM
> > >
> > >
> > >
> > >
> > >
> > >
> > > Has anyone complied number of days on DL data? Number of games
> > missed
> > > due to being on DL?
> > >
> > > That data would be very interesting to consider when forecasting
> > > player stats.
> > >
> > > Craig Tomarkin
> > >
> >
> >
> >
>

#3682 From: CTOMARKIN@...
Date: Tue Dec 23, 2008 3:04 am
Subject: Re: Digest Number 1024
ctomarkin
Send Email Send Email
 
Guys thanks for looking into DL data. If I had sample data to play with I could tell if it was valuable or not for predicting next years stats, as I'm sure others would do as well. If it doesn't add anything, then I probably won't be interested in it next year. But if it does, then I'll need it every year. So if it's hard to compile I might hope it doesn't add anything. :).

Tango - thank you for your beautifully simple Marcels. I've been using it as a baseline to improve from as you suggested people do. I run my tests using SAS and am willing to share my code if anyone wants it. I can spin the forecasts out for any year as a csv file. It takes about 5 seconds to run for any year. My Marcel batting code produces a file that is off only very slightly. However, my pitching calculations for Marcel are off significantly, so there is something I've lost in the translation. Again, I'm happy to share my code if anyone wants it or would like to edit it. And, I'm happy to share my Marcels output for any year. Maybe someone will be able to detect where I'm off on the pitching (although even though they are off, they are close and seemingly quite good).

Aside from missing DL data, I also do not have minor league data. Does anyone have minor league data to an Access format like Lahman's? That would be a very meaning full extention to a players' complete picture. What source are other people using? Ideally, the IDs would match to playerid or retroID for players with mlb experience.

Chone - if you are on this list I love your work. Based on my assessment of predictive quality, your 2008 did extremely well. Better than Pecota by enough to turn my head, since I consider Pecota the standard. I use the sum of the sqaures of the differences between the forecasts and actuals for key stats. Some systems are stronger in some areas than others, so I credit each with its distinctiveness. Mine is really good at getting playing time right, thus the interest in fine tuning it with DL data.

Craig Tomarkin
baseballguru.com



-----Original Message-----
From: baseball-databank@yahoogroups.com
To: baseball-databank@yahoogroups.com
Sent: Mon, 22 Dec 2008 2:44 pm
Subject: [baseball-databank] Digest Number 1024

Messages In This Digest (2 Messages)

1a.
Re: DL data From: Michael Greene
2a.
Re: Lahman database From: Matthew Gargano

Messages

1a.

Re: DL data

Posted by: "Michael Greene" mfgreene79@...   mfgreene79

Mon Dec 22, 2008 9:31 am (PST)

The baseball transactions are all listed on the MLB website (going
back to 2001). I have done some work creating a web spider and
trying to match the names back up to the player lists. If this
information is of interest, I can try and create an extract to get it
up. I don't use MYSQL (I'm a fan of Postgres), but I could probably
set it up with inserts instead.

Mike

On Dec 19, 2008, at 11:03 AM, wydiyd wrote:

> I looked for the data and the only place I could find any data is at:
>
> http://www.baseball-injury-report.com/about.shtml
>
> It is a pay site, but all that I could currently find.
>
> --- In baseball-databank@yahoogroups.com, KJOK <kjokbaseball@...>
> wrote:
> >
> > Craig:
> >
> > This doesn't directly answer your question, but there has been some
> discussion about adding DL and other types of 'off' time to the
> transactions database over on that egroup:
> >
> > http://sports.groups.yahoo.com/group/BBTransactions/?yguid=84616618
> >
> > if you want to search the archives there.
> >
> > THANKS,
> > KJOK
> >
> >
> > --- On Mon, 12/15/08, ctomarkin <CTOMARKIN@...> wrote:
> >
> > From: ctomarkin <CTOMARKIN@...>
> > Subject: [baseball-databank] DL data
> > To: baseball-databank@yahoogroups.com
> > Date: Monday, December 15, 2008, 1:11 PM
> >
> >
> >
> >
> >
> >
> > Has anyone complied number of days on DL data? Number of games
> missed
> > due to being on DL?
> >
> > That data would be very interesting to consider when forecasting
> > player stats.
> >
> > Craig Tomarkin
> >
>
>
>

2a.

Re: Lahman database

Posted by: "Matthew Gargano" mgargano@...   tkestars

Mon Dec 22, 2008 10:10 am (PST)

What about Eva Longoria? I'd be more interested in her.

On Wed, Dec 17, 2008 at 4:04 PM, Ed R <reallyrottens_98@yahoo.com> wrote:

> I think it is already available here:
> http://www.baseball-databank.org/
>
> I put the sql version onto Mysql without problems and found Evan Longoria.
>
> --- On *Wed, 12/17/08, studes <studes@yahoo.com>* wrote:
>
> From: studes <studes@yahoo.com>
> Subject: [baseball-databank] Lahman database
> To: baseball-databank@yahoogroups.com
> Date: Wednesday, December 17, 2008, 2:56 PM
>
> I was wondering when the Lahman database is going to be ready for
> public consumption. The website still says end of November, which
> was, you know, a while ago.
>
> Thanks,
> dave
>
>
>
>
Recent Activity
Visit Your Group
Yahoo! News
You won't believe
it, but it's true
Yahoo! Groups
Learn to go green.
Save energy. Save the planet.
10 Day Club
Share the benefits
of a high fiber diet.
Need to Reply?
Click one of the "Reply" links to respond to a specific message in the Daily Digest.
Create New Topic | Visit Your Group on the Web

#3683 From: Michael Greene <mfgreene79@...>
Date: Tue Dec 23, 2008 3:01 pm
Subject: Re: Re: DL data
mfgreene79
Send Email Send Email
 
I probably should have mentioned the name of the file.  It is:

mlb_trans_w_schema.sql.gz



From: mfgreene79 <mfgreene79@...>
To: baseball-databank@yahoogroups.com
Sent: Monday, December 22, 2008 10:38:27 PM
Subject: [baseball-databank] Re: DL data


All,

Just uploaded the MLB transaction data, parsed out from the MLB
website. The data includes all player transactions from 2001 up
through the end of October 2008. Transactions are injuries, trades,
moves to the minors and back etc.

The data is in free form text, but there are some patterns. I've
coded some rules to parse the information out into some basic
categories (to the DL, to the minors, trades, etc). One other issue
is that the transactions are given by player name, no identifying
number is given. I've tried to add the MLBID (sometimes called the
mlbam), which is used as a key to player sites on the MLB site. The
table is rife with errors and mistakes, but some of the data is really
messy (especially in 2001-2002).

The layout of the table is below. The file is actually a dump from a
Postgres database, but it should be basically compatible with MySQL
and other databases. The file has the CREATE TABLE statements and
then loads the data with INSERT statements. There are 21,677 records
in the data.

I'm all for free data, enjoy!

Table "mlb.mlb_trans"
Column | Type | Modifiers
------------ -+------- --------- --------- ----+---- -------
transid | integer |
playername | text |
playerpos | text |
playerid | integer |
teamabbr | text |
trans_date | date |
trans_txt | text |
to_dl_ind | smallint |
from_dl_ind | smallint |
dl_txt | text |
minors_ind | smallint |
minors_txt | text |
injury | smallint |
injury_txt | text |
callup_ind | smallint |
callup_txt | text |
sign_ind | smallint |
sign_txt | text |
insertdate | timestamp without time zone |
lost_ind | smallint |
lost_txt | text |

Thanks,

Mike

--- In baseball-databank@ yahoogroups. com, Michael Greene
<mfgreene79@ ...> wrote:
>
> The baseball transactions are all listed on the MLB website (going
> back to 2001). I have done some work creating a web spider and
> trying to match the names back up to the player lists. If this
> information is of interest, I can try and create an extract to get it
> up. I don't use MYSQL (I'm a fan of Postgres), but I could probably
> set it up with inserts instead.
>
> Mike
>
>
>
> On Dec 19, 2008, at 11:03 AM, wydiyd wrote:
>
> > I looked for the data and the only place I could find any data is at:
> >
> > http://www.baseball -injury-report. com/about. shtml
> >
> > It is a pay site, but all that I could currently find.
> >
> > --- In baseball-databank@ yahoogroups. com, KJOK <kjokbaseball@ >
> > wrote:
> > >
> > > Craig:
> > >
> > > This doesn't directly answer your question, but there has been some
> > discussion about adding DL and other types of 'off' time to the
> > transactions database over on that egroup:
> > >
> > > http://sports. groups.yahoo. com/group/ BBTransactions/ ?yguid=84616618
> > >
> > > if you want to search the archives there.
> > >
> > > THANKS,
> > > KJOK
> > >
> > >
> > > --- On Mon, 12/15/08, ctomarkin <CTOMARKIN@> wrote:
> > >
> > > From: ctomarkin <CTOMARKIN@>
> > > Subject: [baseball-databank] DL data
> > > To: baseball-databank@ yahoogroups. com
> > > Date: Monday, December 15, 2008, 1:11 PM
> > >
> > >
> > >
> > >
> > >
> > >
> > > Has anyone complied number of days on DL data? Number of games
> > missed
> > > due to being on DL?
> > >
> > > That data would be very interesting to consider when forecasting
> > > player stats.
> > >
> > > Craig Tomarkin
> > >
> >
> >
> >
>


Messages 3654 - 3683 of 4385   Oldest  |  < Older  |  Newer >  |  Newest
Add to My Yahoo!      XML What's This?

Copyright © 2010 Yahoo! Inc. All rights reserved.
Privacy Policy - Terms of Service - Guidelines NEW - Help