So, if you were to try to use nikola_wordpress_importer from master now, it would:
- Not crash ;-)
- Download attachments
- Fix links to attachments so they work on the new site
However, I am now unsure of what exactly is in wordpress.com's export XML file. The posts themselves are in this form:
Muchas gracias Nico por hacer el video este. Groso, quedó muy bueno. [youtube=http://www.youtube.com/watch?hl=es&v=882qxARXa6c]
Two things jump to me:
- That's not HTML
- WTF is that youtube thing?
I am having some success processing it as markdown, since that handles the paragraph breaks and some other stuff. Maybe the youtube embedding is done with a markdown extension?