Venus Error with Unicode feed

Zack zajmal at hotmail.com
Sat Sep 15 06:05:24 EST 2007


Hi,

I run an Urdu Planet comprising blogs written in Urdu language. I am
thinking of switching to Venus from Planet 2.0.

I ran runtests.py and it ran ok. However, when I tried to run planet with my
config.ini I got errors. To figure out what was happening I removed all
feeds from the config except this one:

http://www.zackvision.com/weblog/urdu.xml

When I run "python2.4 planet.py urdu/config.ini", I get this error:

INFO:planet.runner:Socket timeout set to 20 seconds
INFO:planet.runner:Building work queue
INFO:planet.runner:Updating feed http://www.zackvision.com/weblog/urdu.xml
ERROR:planet.runner:Error processing
http://www.zackvision.com/weblog/urdu.xml
ERROR:planet.runner:UnicodeDecodeError: 'ascii' codec can't decode byte 0xd8
in position 8: ordinal not in range(128)
ERROR:planet.runner:  File
"/home/.bartholimew/urduwebo/dev.urduweb.org/venus/planet/spider.py", line
433, in spiderPlanet
    writeCache(uri, feed_info, data)
ERROR:planet.runner:  File
"/home/.bartholimew/urduwebo/dev.urduweb.org/venus/planet/spider.py", line
205, in writeCache
    output = xdoc.toxml().encode('utf-8')
ERROR:planet.runner:  File "/usr/lib/python2.4/xml/dom/minidom.py", line 47,
in toxml
    return self.toprettyxml("", "", encoding)
ERROR:planet.runner:  File "/usr/lib/python2.4/xml/dom/minidom.py", line 62,
in toprettyxml
    return writer.getvalue()
ERROR:planet.runner:  File "/usr/lib/python2.4/StringIO.py", line 271, in
getvalue
    self.buf += ''.join(self.buflist)
INFO:planet.runner:Loading cached data
Traceback (most recent call last):
  File "planet.py", line 68, in ?
    splice.apply(doc.toxml('utf-8'))
  File "/usr/lib/python2.4/xml/dom/minidom.py", line 47, in toxml
    return self.toprettyxml("", "", encoding)
  File "/usr/lib/python2.4/xml/dom/minidom.py", line 59, in toprettyxml
    self.writexml(writer, "", indent, newl, encoding)
  File "/usr/lib/python2.4/xml/dom/minidom.py", line 1746, in writexml
    node.writexml(writer, indent, addindent, newl)
  File "/usr/lib/python2.4/xml/dom/minidom.py", line 816, in writexml
    node.writexml(writer,indent+addindent,addindent,newl)
  File "/usr/lib/python2.4/xml/dom/minidom.py", line 811, in writexml
    _write_data(writer, attrs[a_name].value)
  File "/usr/lib/python2.4/xml/dom/minidom.py", line 303, in _write_data
    writer.write(data)
  File "/usr/lib/python2.4/codecs.py", line 178, in write
    data, consumed = self.encode(object, self.errors)
UnicodeDecodeError: 'ascii' codec can't decode byte 0xd8 in position 12:
ordinal not in range(128)

And no output is generated.

-- 
Zack
zajmal at hotmail.com 



More information about the devel mailing list