So, if you were to try to use
nikola_wordpress_importer from master now, it would:
Not crash ;-)
Fix links to attachments so they work on the new site
However, I am now unsure of what exactly is in wordpress.com's export XML file. The posts themselves are in this form:
Muchas gracias Nico por hacer el video este. Groso, quedó muy bueno. [youtube=http://www.youtube.com/watch?hl=es&v=882qxARXa6c]
Two things jump to me:
That's not HTML
WTF is that youtube thing?
I am having some success processing it as markdown, since that handles the paragraph breaks and some other stuff. Maybe the youtube embedding is done with a markdown extension?