SourceForge.net
2010-10-06 14:47:47 UTC
Feature Requests item #2825794, was opened at 2009-07-23 07:39
Message generated for change (Comment added) made by dekarl
You can respond by visiting:
https://sourceforge.net/tracker/?func=detail&atid=424138&aid=2825794&group_id=39046
Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Private: No
Submitted By: Mario Medina Nussbaum (medisoft)
Assigned to: Nobody/Anonymous (nobody)
Summary: tv listing source for cablevision digital Mexico City
Initial Comment:
Hi! I want to tell you that from www.tvguide.com there is possible to fetch listings of Mexico. It seems to use ajax, but i think it is possible to fetch the results.
----------------------------------------------------------------------
Message:
I'll close this for now as it's not going anywhere without someone doing
the actual work :)
If you'd like to provide a legal data service for xmltv you can take a
look at the nonametv framework that's being used by some of the data
services. But it's quite a bit of work to contact every station to get
access to their data and the rights for redistribution.
If you find a friendly datasource that grants access for personal use but
not for redistribution you can take a peek at HowtoWriteAGrabber or
tv_grab_pt_meo for an example of what the grabber would look like.
----------------------------------------------------------------------
Comment By: Mario Medina Nussbaum (medisoft)
Date: 2009-07-25 16:31
Message:
I read again the terms, and it says nothing about distribution or
redistribution :) I don't know about perl, almost anything. But maybe this
is good time to begin to learn.
----------------------------------------------------------------------
Comment By: Robert Eden (rmeden)
Date: 2009-07-25 07:42
Message:
Take a look at some of the other scrapers for ideas... Personally, I like
using HTML::TableExtract and WWW::Mechanize.
The site terms often have a restriction against redistribution of the
data. If that's the case, you may not be able to legally generate a XMLTV
file from a local database and distribute it. ( local copyright laws vary
of course)
The redistribution restriction is why most (all?) source hit the data
source directly.
----------------------------------------------------------------------
Comment By: Mario Medina Nussbaum (medisoft)
Date: 2009-07-25 03:25
Message:
I think i'm going to make some type of parser that extracts the info from
this page and stores on a local database, and this database will be the
source for xmltv, to avoid lots of requests to the provider service.
I know very well PHP and Linux, do you have some advice to me to do this?
(like xmltv format sample or methods to extract the information)
Thanks
----------------------------------------------------------------------
Comment By: Mario Medina Nussbaum (medisoft)
Date: 2009-07-25 02:48
Message:
i want to add that enlatele.com.mx doesn't have any in their terms about
that, nothing about robots, and automated extraction processes, and being a
html source, with ALL the TV providers in Mexico I think it is the best
source at this moment.
----------------------------------------------------------------------
Comment By: Mario Medina Nussbaum (medisoft)
Date: 2009-07-25 02:31
Message:
I found another source, http://enlatele.com.mx.
they use plain html, so it would be possible to extract the info easily
----------------------------------------------------------------------
Comment By: Robert Eden (rmeden)
Date: 2009-07-23 07:48
Message:
is there anything in the sites' terms of service to preclude a scraper?
If not, the next step is to find someone (presumably in Mexico, who wants
the listings) to write one. The XMLTV SF project is mostly a collective
of folks with an "itch to scratch". They write grabbers to satisfy their
needs, then share them.
We would certainly consider it for inclusion.
----------------------------------------------------------------------
Comment By: Mario Medina Nussbaum (medisoft)
Date: 2009-07-23 07:42
Message:
I also found this http://www.cablevision.net.mx/programacion.html from the
cable provider directly
----------------------------------------------------------------------
You can respond by visiting:
https://sourceforge.net/tracker/?func=detail&atid=424138&aid=2825794&group_id=39046
Message generated for change (Comment added) made by dekarl
You can respond by visiting:
https://sourceforge.net/tracker/?func=detail&atid=424138&aid=2825794&group_id=39046
Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: None
Group: None
Status: Closed
Priority: 5Group: None
Status: Closed
Private: No
Submitted By: Mario Medina Nussbaum (medisoft)
Assigned to: Nobody/Anonymous (nobody)
Summary: tv listing source for cablevision digital Mexico City
Initial Comment:
Hi! I want to tell you that from www.tvguide.com there is possible to fetch listings of Mexico. It seems to use ajax, but i think it is possible to fetch the results.
----------------------------------------------------------------------
Comment By: Karl Dietz (dekarl)
Date: 2010-10-06 16:47Message:
I'll close this for now as it's not going anywhere without someone doing
the actual work :)
If you'd like to provide a legal data service for xmltv you can take a
look at the nonametv framework that's being used by some of the data
services. But it's quite a bit of work to contact every station to get
access to their data and the rights for redistribution.
If you find a friendly datasource that grants access for personal use but
not for redistribution you can take a peek at HowtoWriteAGrabber or
tv_grab_pt_meo for an example of what the grabber would look like.
----------------------------------------------------------------------
Comment By: Mario Medina Nussbaum (medisoft)
Date: 2009-07-25 16:31
Message:
I read again the terms, and it says nothing about distribution or
redistribution :) I don't know about perl, almost anything. But maybe this
is good time to begin to learn.
----------------------------------------------------------------------
Comment By: Robert Eden (rmeden)
Date: 2009-07-25 07:42
Message:
Take a look at some of the other scrapers for ideas... Personally, I like
using HTML::TableExtract and WWW::Mechanize.
The site terms often have a restriction against redistribution of the
data. If that's the case, you may not be able to legally generate a XMLTV
file from a local database and distribute it. ( local copyright laws vary
of course)
The redistribution restriction is why most (all?) source hit the data
source directly.
----------------------------------------------------------------------
Comment By: Mario Medina Nussbaum (medisoft)
Date: 2009-07-25 03:25
Message:
I think i'm going to make some type of parser that extracts the info from
this page and stores on a local database, and this database will be the
source for xmltv, to avoid lots of requests to the provider service.
I know very well PHP and Linux, do you have some advice to me to do this?
(like xmltv format sample or methods to extract the information)
Thanks
----------------------------------------------------------------------
Comment By: Mario Medina Nussbaum (medisoft)
Date: 2009-07-25 02:48
Message:
i want to add that enlatele.com.mx doesn't have any in their terms about
that, nothing about robots, and automated extraction processes, and being a
html source, with ALL the TV providers in Mexico I think it is the best
source at this moment.
----------------------------------------------------------------------
Comment By: Mario Medina Nussbaum (medisoft)
Date: 2009-07-25 02:31
Message:
I found another source, http://enlatele.com.mx.
they use plain html, so it would be possible to extract the info easily
----------------------------------------------------------------------
Comment By: Robert Eden (rmeden)
Date: 2009-07-23 07:48
Message:
is there anything in the sites' terms of service to preclude a scraper?
If not, the next step is to find someone (presumably in Mexico, who wants
the listings) to write one. The XMLTV SF project is mostly a collective
of folks with an "itch to scratch". They write grabbers to satisfy their
needs, then share them.
We would certainly consider it for inclusion.
----------------------------------------------------------------------
Comment By: Mario Medina Nussbaum (medisoft)
Date: 2009-07-23 07:42
Message:
I also found this http://www.cablevision.net.mx/programacion.html from the
cable provider directly
----------------------------------------------------------------------
You can respond by visiting:
https://sourceforge.net/tracker/?func=detail&atid=424138&aid=2825794&group_id=39046