Go Back  (BETA) DVD Talk Forum > Shopping Discussions > DVD Bargains > DVD Clubs
Reload this Page >

The new updated list of CH DVDs (data miner)

Community
Search
DVD Clubs Discuss & Strategize about DVD Clubs like Columbia House

The new updated list of CH DVDs (data miner)

Thread Tools
 
Search this Thread
 
Old 08-14-05, 07:14 AM
  #426  
DVD Talk Gold Edition
 
Join Date: Jan 2002
Posts: 2,926
Likes: 0
Received 1 Like on 1 Post
Does this display properly for anyone using Open Office?

The columns seem to display multiple data in each rather than each being exclusive to its own (ie. selection numbers mixed in with all sorts of other data like widescreen, etc).

Is this something I can fix with a setting change?
Old 09-04-05, 05:27 AM
  #427  
Member
 
Join Date: Mar 2002
Posts: 135
Likes: 0
Received 0 Likes on 0 Posts
The files are "delimited" by a character like a comma or semi-colon. Try to find a setting in your program to read it as delimited.
Old 09-04-05, 03:39 PM
  #428  
DVD Talk Gold Edition
 
Join Date: Jan 2002
Posts: 2,926
Likes: 0
Received 1 Like on 1 Post
OOo properly recognizes and opens it as delimited but the data is still being displayed as above. Is there anyone else using OOo that is experiencing this?

I've tried both semi-colon and comma delimted (and both together) but no change.

I don't know enough about how this works but there does appear to be duplicate commas and unneeded characters within the csv file (not sure if they are needed or not or could even be the cause but, without additional eyes, is the only thing I can think of).
Old 09-26-05, 11:46 PM
  #429  
DVD Talk Platinum Edition
 
Join Date: Jul 2003
Location: New Hampshire
Posts: 3,096
Likes: 0
Received 0 Likes on 0 Posts
Originally Posted by basaro
OK guys, I have something small and slightly usable up now at:
http://www.saladbarscam.com/ch
Note the www. and /ch are required, otherwise you won't get there.

It's a rough sample of what will ultimately be available. I need to do a lot more work on it. Now it's just a sortable list of all the titles (from jhester's first datamine) available only at 50 titles at a time (the 50 limit is temporary for this trial). I will update it with his second datamine soon, and work on the other data from nickelplated as well.
I welcome all comments and suggestions, positive or negative. I will announce the rest of my plans later. I'm in a rush this afternoon!!

See ya.

Oh yeah, I designed and tested this with Firefox! If something looks wacky in IE, switch browsers and let me know, OK ?
I updated my site with the last import from August. It still needs work, but it's coming along (the data is all accurate though). The next thing will be to fix a couple of the issues I mentioned before (which I haven't got around to yet), and to add a list of pre-order titles. I also have to work on incorporating the item number since it is now available in jhester's data, then I can create direct links to CH! I am also going to clean up the listing in the import update section, and obviously work on the ui. There are some accurate counts on the homepage now, but there are still a few more ways I want to organize the data (like regular price changes).

Is there a new import coming soon? It will only take me an hour or so to import a new datamine into my database now that things seem to be going smoothly for me. And soon, if jhester wants, he can upload the file to my server himself, and it will be taken care of on it's own.

I also jacked my sorted listing count up to 1000 at a time. It might take a little longer to load, but it is probably much more useful this way.

Cheers
Old 09-27-05, 11:28 AM
  #430  
Senior Member
 
Join Date: Jun 2003
Posts: 270
Likes: 0
Received 0 Likes on 0 Posts
Basaro, you da man!! I can't tell you how much the work that you and the others working to get CH titles obtained and compiled is appreciated! Keep up the awesome work!

Originally Posted by basaro
I updated my site with the last import from August. It still needs work, but it's coming along (the data is all accurate though). The next thing will be to fix a couple of the issues I mentioned before (which I haven't got around to yet), and to add a list of pre-order titles. I also have to work on incorporating the item number since it is now available in jhester's data, then I can create direct links to CH! I am also going to clean up the listing in the import update section, and obviously work on the ui. There are some accurate counts on the homepage now, but there are still a few more ways I want to organize the data (like regular price changes).

Is there a new import coming soon? It will only take me an hour or so to import a new datamine into my database now that things seem to be going smoothly for me. And soon, if jhester wants, he can upload the file to my server himself, and it will be taken care of on it's own.

I also jacked my sorted listing count up to 1000 at a time. It might take a little longer to load, but it is probably much more useful this way.

Cheers
Old 09-27-05, 02:03 PM
  #431  
Senior Member
 
Join Date: Mar 2004
Posts: 789
Likes: 0
Received 0 Likes on 0 Posts
Originally Posted by xpfshost
Me too!
Old 09-30-05, 08:54 AM
  #432  
DVD Talk Platinum Edition
 
Join Date: Jul 2003
Location: New Hampshire
Posts: 3,096
Likes: 0
Received 0 Likes on 0 Posts
Fixed a couple things last night and added the pre-orders. Search is coming next. I've been working on something, but it's soooo slow right now. Needs some tweaking.

We need a new import though! Jhester? I would be happy to run the script every week, and keep the db updated, if you cannot. I am still feeling spoiled from when bga used to do this weekly.
Old 10-11-05, 05:25 PM
  #433  
Member
 
Join Date: Mar 2002
Posts: 135
Likes: 0
Received 0 Likes on 0 Posts
No I'm not dead yet (Despite appearances from my absence!) I find myself in school and covered up with work. So I apologize for not keeping my data more current. I can't make any promises, but I will try to work on getting an updated version. However, my process is not ideal, and I really hope that some other brave soul will step up with a process with better integrity than mine.
Old 10-11-05, 06:31 PM
  #434  
DVD Talk Platinum Edition
 
Join Date: Jul 2003
Location: New Hampshire
Posts: 3,096
Likes: 0
Received 0 Likes on 0 Posts
Sweet! I am ready to import the changes into my database whenever you get the next datamine. Thanks, keep us updated.
And good luck in school
Old 12-04-05, 03:59 PM
  #435  
Member
 
Join Date: Jan 2003
Location: Simpsonville SC
Posts: 185
Likes: 0
Received 0 Likes on 0 Posts
I am messing around with a data miner for CH. I basically have it working but need to know the range of the Item numbers. I am pretty sure that all Item #s are between 1500000 and 1799999. Does anyone know if there is anything outside this range or if it can be tightened more.
Old 12-05-05, 07:12 AM
  #436  
DVD Talk Platinum Edition
 
Join Date: Jul 2003
Location: New Hampshire
Posts: 3,096
Likes: 0
Received 0 Likes on 0 Posts
AlfB,

I saw your other thread and responded there, but I was going to ask you post additional comments here, but you beat me to it! So here is what I posted in the other thread, ignore that one, and please just followup here, thanks!



Cool. I haven't had time to write my own data miner. If you can generate this in a csv format similar to what jhester had done, I can import it into my db as well.

Based on the last import that I did from jhester, there were 10,527 total items: Starting with 1548333 "10" and ending with 1776444 "The Polar Express Gift Set". You might be able to tighten up your query a little more based on this, but I can't guarantee there aren't titles outside this range.

You're on the right track, thanks for your contribution!
Old 12-05-05, 09:10 AM
  #437  
Member
 
Join Date: Jan 2003
Location: Simpsonville SC
Posts: 185
Likes: 0
Received 0 Likes on 0 Posts
I looked at the file and decided to start with 1500000 and go through 1799999. I have already done the 1500000-1599999. I will be doing 16 tonight and 17 tommorrow night since it takes about 8-9 hours for each one. One gotcha for the file is that I am not capturing pricing as I am not currently a member. I am getting enrollment versus non enrollment which was my main goal here to begin with. Currently the data I am getting is Item #, Sel #, Enrollment/Not, Title, Rating, Num discs, Run Time, Studio, Rel Date and Format. When complete, I can provide in a comma delimited text file or an Excel Spreadsheet. What would be best way to get it to you?

Last edited by AlfB; 12-05-05 at 02:38 PM.
Old 12-05-05, 01:02 PM
  #438  
DVD Talk Legend
 
Join Date: Sep 2004
Location: Twin Cities, US of A
Posts: 14,163
Received 169 Likes on 134 Posts
Somebody get this man a membership, STAT!
Old 12-05-05, 07:53 PM
  #439  
DVD Talk Platinum Edition
 
Join Date: Jul 2003
Location: New Hampshire
Posts: 3,096
Likes: 0
Received 0 Likes on 0 Posts
Originally Posted by AlfB
I looked at the file and decided to start with 1500000 and go through 1799999. I have already done the 1500000-1599999. I will be doing 16 tonight and 17 tommorrow night since it takes about 8-9 hours for each one. One gotcha for the file is that I am not capturing pricing as I am not currently a member. I am getting enrollment versus non enrollment which was my main goal here to begin with. Currently the data I am getting is Item #, Sel #, Enrollment/Not, Title, Rating, Num discs, Run Time, Studio, Rel Date and Format. When complete, I can provide in a comma delimited text file or an Excel Spreadsheet. What would be best way to get it to you?
Please contact me through email via my profile here at dvdtalk. If this works out to be a good thing, I can give you access to my server in the future and let you add the new files at your leisure. Then everything will be updated automatically.

Thanks for your hard work! It will be nice to have enrollment status again.
Old 12-05-05, 09:09 PM
  #440  
Member
 
Join Date: Jan 2003
Location: Simpsonville SC
Posts: 185
Likes: 0
Received 0 Likes on 0 Posts
Originally Posted by basaro
Please contact me through email via my profile here at dvdtalk. If this works out to be a good thing, I can give you access to my server in the future and let you add the new files at your leisure. Then everything will be updated automatically.

Thanks for your hard work! It will be nice to have enrollment status again.
No problem. Will contact you when I have the complete data set. As I type this, I am at 1632082 on the second set. The last set should be complete Wed AM. Most likely it will finish too late to send that morning so you should have it early Wed evening. Do you want comma delimited text or an excel file?

By the way, are you anywhere near Bow? My sister lives there.

Last edited by AlfB; 12-05-05 at 09:12 PM.
Old 12-06-05, 06:59 AM
  #441  
DVD Talk Platinum Edition
 
Join Date: Jul 2003
Location: New Hampshire
Posts: 3,096
Likes: 0
Received 0 Likes on 0 Posts
Originally Posted by AlfB
No problem. Will contact you when I have the complete data set. As I type this, I am at 1632082 on the second set. The last set should be complete Wed AM. Most likely it will finish too late to send that morning so you should have it early Wed evening. Do you want comma delimited text or an excel file?

By the way, are you anywhere near Bow? My sister lives there.
Sweet. Forgot to mention my import handles csv files, so that would be great.

I am about 45min from Bow. I used to live in Concord there myself for a while. I'd love to move back to that area if I can ever find a good job that isn't in Mass, but for now, I'm stuck near the border.

See my post above for a link to the website where I host this stuff if you're interested. If your data works out well, I'll start putting in the other features and start making some changes so it will be better.

Last edited by basaro; 12-06-05 at 07:03 AM.
Old 12-06-05, 09:57 PM
  #442  
Member
 
Join Date: Jan 2003
Location: Simpsonville SC
Posts: 185
Likes: 0
Received 0 Likes on 0 Posts
OK guys, bad news. I found a bug in the code and had to fix it. Unfortunately it gave an erroneous result in some cases for the enrollment/not field. I've fixed it and started the run again. The good news is that I had enough info to cull the run down to two nights. The bad news is that it will be another day before we have data as I will run one tonight and the next tommorrow. basaro, I will send it as soon as I have it. Most likely Thursday evening. Sorry for the delay guys.
Old 12-11-05, 08:59 AM
  #443  
DVD Talk Platinum Edition
 
Join Date: Jul 2003
Location: New Hampshire
Posts: 3,096
Likes: 0
Received 0 Likes on 0 Posts
Just an update for everyone:

AlfB has done a great job so far on his datamine. We're working out the kinks and caveats, and something will be imported into my database soon.

For the meantime I'll be putting the newest datamine csv file up on my site so you all can browse it manually for now. I'll post a link to it a little later this afternoon. I'm off to get supplies for football right now.

Big to AlfB!
Old 12-15-05, 12:56 PM
  #444  
DVD Talk Platinum Edition
 
Join Date: Jul 2003
Location: New Hampshire
Posts: 3,096
Likes: 0
Received 0 Likes on 0 Posts
I posted up AlfB's datamine file on my site. Lots of interesting data in it like future titles that aren't even available for pre-order yet! This could be a good reference for looking up Columbia Tri-Star and Universal titles which never show up until release date! I haven't verified that in the data yet, but it all seems possible.

Go to the news section I just added to my site. Again, nothing special, but it gets the job done for now.

Sorry for the delay on my end too. My server crashed on me the other day,really really need a new one now

http://www.saladbarscam.com/web/ch.nsf

Thanks AlfB!
Old 12-15-05, 02:37 PM
  #445  
DVD Talk Gold Edition
 
Join Date: Jan 2002
Posts: 2,926
Likes: 0
Received 1 Like on 1 Post
Looks pretty good. I'm not sure if this is just an Open Office issue but some of the titles need fine tuning as the seperation of fields are getting confused by some extra commas (I think).

Perhaps there is an easy, and automated, way of removing all commas from any fields so as not to confuse their seperations?

For examples (there are more though)..

They Shoot Movies Don't They? - the Making of Mirage
The Jeff Corwin Experience: Out on a Limb - Monkeys
Good Night and Good Luck
Old 12-15-05, 03:02 PM
  #446  
DVD Talk Platinum Edition
 
Join Date: Jul 2003
Location: New Hampshire
Posts: 3,096
Likes: 0
Received 0 Likes on 0 Posts
Originally Posted by abintra
Looks pretty good. I'm not sure if this is just an Open Office issue but some of the titles need fine tuning as the seperation of fields are getting confused by some extra commas (I think).

Perhaps there is an easy, and automated, way of removing all commas from any fields so as not to confuse their seperations?

For examples (there are more though)..

They Shoot Movies Don't They? - the Making of Mirage
The Jeff Corwin Experience: Out on a Limb - Monkeys
Good Night and Good Luck
It's not an open office issue, it's a problem with the format of the titles in the datamine. This is mentioned right above the download on my site, as one of the caveats of the new datamine. I have already made the request on how to fix this to AlfB, and I believe we are in agreement, and that should be fixed for the next run. There are a few more things we are working on too.
If there is something other than the already known caveats, please let us know. I will have to make sure the data is coming in correctly before I can add it to my database and start generating stats, etc.

From my site:
Here is a list of the caveats right now:

1. The Enrollment column is opposite as stated. If Enrollment is specified, then it is actually Member Only. If Member is specified, then it is actually an Enrollment.
2. Some titles which contain commas in them, get forced into the next column and it throws the rest of that row off.
3. Some titles show up which aren't even available for pre-order yet (future releases)! Likewise there is no Pre-Order indicator at this time either.
4. There is no header for the list - Here is the current format:
* Item, Selection, Enrollment, Title, Rating, Discs, Time, Studio, Year, Format


#1,2 & 4 should all be fixed for the next datamine. #3 we're not so sure about how to fix that yet.


Thanks for the input

Last edited by basaro; 12-15-05 at 03:06 PM.
Old 12-15-05, 03:06 PM
  #447  
DVD Talk Gold Edition
 
Join Date: Jan 2002
Posts: 2,926
Likes: 0
Received 1 Like on 1 Post
Originally Posted by basaro
This is mentioned right above the download on my site, as one of the caveats of the new datamine.
Brilliant.

Thanks.
Old 12-16-05, 10:45 AM
  #448  
DVD Talk Hall of Fame
 
lizard's Avatar
 
Join Date: May 2000
Location: the Western Slope, Colorado
Posts: 7,944
Received 2 Likes on 2 Posts
Thank you basaro and AlfB! I was wondering if Columbia House was going to carry Serenity and it is on your list:
4315701 Serenity (Widescreen), list price $29.98, no sale price yet.
4316105 Serenity (Fullscreen, I think)

Since I passed on the recent BOGOF codes to save my fulfillments for Serenity, I am pleased that they are going to have it, to put it mildly.
Old 12-16-05, 11:57 AM
  #449  
Senior Member
 
Join Date: Jun 2003
Posts: 270
Likes: 0
Received 0 Likes on 0 Posts
Just wanted to say that you guys are the BEST!! Thanks a MILLION and keep up the good work!!
Old 12-17-05, 09:50 AM
  #450  
Member
 
Join Date: Jan 2003
Location: Simpsonville SC
Posts: 185
Likes: 0
Received 0 Likes on 0 Posts
Just wanted to let everyone know the current status. I was in the middle of doing another download when the ice storm hit the Carolinas. I lost power and my internet connection. I have about half the new download done and now that I have an internet connection again, I hope to finish tonight.


Archive - Advertising - Cookie Policy - Privacy Statement - Terms of Service - Your Privacy Choices -

Copyright © 2024 MH Sub I, LLC dba Internet Brands. All rights reserved. Use of this site indicates your consent to the Terms of Use.