SourceForge.net
2011-06-21 19:39:34 UTC
Bugs item #3324199, was opened at 2011-06-21 20:39
Message generated for change (Tracker Item Submitted) made by user1024
You can respond by visiting:
https://sourceforge.net/tracker/?func=detail&atid=424135&aid=3324199&group_id=39046
Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: tv_imdb
Group: None
Status: Open
Resolution: None
Priority: 5
Private: No
Submitted By: user1024 (user1024)
Assigned to: Jerry Veldhuis (jveldhuis)
Summary: Character encoding of xml input not followed
Initial Comment:
I have an xmltv file with utf-8 encoding. When I applied tv_imdb to this file an actor tag was added but the actors name was not encoded in utf-8. Firefox gives an "XML Parsing Error: not well-formed" page when I try to open the output xmltv file.
Attached is a patch of a work around which when applied generates an output file which will open without error in Firefox. This patch is intended to illustrate the problem not to provide a general solution. I expect that the same issue applies to other tags not just the author tag.
I am unsure at which stage the issue should be addressed. I imagine that the encoding could be changed when the imdb database files are created, when text is read from the database file or when the text is inserted into the output structure, or is it a problem with the imdb database files? etc.
----------------------------------------------------------------------
You can respond by visiting:
https://sourceforge.net/tracker/?func=detail&atid=424135&aid=3324199&group_id=39046
Message generated for change (Tracker Item Submitted) made by user1024
You can respond by visiting:
https://sourceforge.net/tracker/?func=detail&atid=424135&aid=3324199&group_id=39046
Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: tv_imdb
Group: None
Status: Open
Resolution: None
Priority: 5
Private: No
Submitted By: user1024 (user1024)
Assigned to: Jerry Veldhuis (jveldhuis)
Summary: Character encoding of xml input not followed
Initial Comment:
I have an xmltv file with utf-8 encoding. When I applied tv_imdb to this file an actor tag was added but the actors name was not encoded in utf-8. Firefox gives an "XML Parsing Error: not well-formed" page when I try to open the output xmltv file.
Attached is a patch of a work around which when applied generates an output file which will open without error in Firefox. This patch is intended to illustrate the problem not to provide a general solution. I expect that the same issue applies to other tags not just the author tag.
I am unsure at which stage the issue should be addressed. I imagine that the encoding could be changed when the imdb database files are created, when text is read from the database file or when the text is inserted into the output structure, or is it a problem with the imdb database files? etc.
----------------------------------------------------------------------
You can respond by visiting:
https://sourceforge.net/tracker/?func=detail&atid=424135&aid=3324199&group_id=39046