rst2rst works (80% or so)

2006-11-02 23:20

What is it? A program that takes a docutils document tree ( parsed from a RST document or programatically generated) then dumps as close as I can guess to reasonable RST back.

This lets Restructured Text be a saveable data format, which is nice.

It's not done as a docutils writer. Sorry, I couldn't make that work.

What works? Most of it.

What doesn't? A dozen directives, custom interpreted text roles, and tables.

Yes, all of those are important. But the rest seems to work ok!

Look: a 804 line RST document containing almost every feature of the language, and the only difference in the generated HTML output between the original and rst2rst's is an invisible difference in continuation lines in line blocks.

[ralsina@monty wp]$ python rst2rst.py t1.txt > t2.txt
[ralsina@monty wp]$ /usr/bin/rst2html.py t1.txt t1.html ;  /usr/bin/rst2html.py t2.txt t2.html
[ralsina@monty wp]$ diff t1.html t2.html
468,469c468,469
< <div class="line">But I'm expecting a postal order and I can pay you back
< as soon as it comes.</div>
---
> <div class="line">But I'm expecting a postal order and I can pay you back</div>
> <div class="line">as soon as it comes.</div>
[ralsina@monty wp]$ wc -l t1.txt
804 t1.txt

You can get rst2rst.py and the testfile.

Anyone knows of a real docutils test suite I could borrow?

Hacking Restructured Text

2006-10-29 15:59

I am a great fan of Restructured Text. I write my blog using it. I write my business proposals using it, I write my documentation using it, I think you should write almsot everything you write now using it. I have even blogged many times about it.

RST is a minimal markup language. You can figure it out in a couple of hours, and then use it to produce pretty HTML pages, PDF docs, man pages, LaTeX documents, S5 slides, and other things.

Plus, the source works as a plain text version, and is very readable:

This is a title
===============

Some text in a paragraph

A subtitle
----------

* A list

* More items

  1. A numbered sublist

  2. Another item

     a) A sub-sub-list

     b) With more items


+-----------------------+-------------------------+
|   A table             | With two columns        |
+-----------------------+-------------------------+
|  And Two              |   rows                  |
+-----------------------+-------------------------+

See? Nice.

RST has another great thing that is not so well known: there is a parser for it, which turns the document into a tree of nodes reppresenting different parts of the document.

You can manipulate this node tree, modifying the document, and then generate the output.

But there is no way, right now, to generate RST from the tree. Which means it's a one way road.

Well, I am hacking to fix that.

Right now, I handle titles, sections, all sorts of lists, transitions, quotes, emphasis, italics, and a few other elements.

The only ones that seem difficult to implement are tables, but I still think I can do it. Although the produced RST doesn't look the same as the original, it is functionally identical.

How do I test if it works? With a test suite. If it works, it should be invariant this way:

RSTsample -> rst2html produces the exact same output as RSTsample -> rst2rst -> rst2html

If anyone wants a copy, email me.

Some people say anything

2006-10-25 16:01

Last night I saw an "investigative news" program on the TV. It's called "Informe Central", and their headline story was about an abandoned factory in San telmo (where tourists go to see typical BA and locals go to see tourists).

The thing is, that factory has been taken over by poor people who live there. It's conveniently located, and they don't pay anything.

On the other hand, it's a nest of drug, rape, poverty and violence, but that's not the only thing these "journalists" said.

They said they lived in inhumane conditions, up to 2.6 persons per square meter.

They also said about 300 people live there, which would mean there are roughly 115 square meters in the factory.

Which is, actually, closer to 1200 square meters. or maybe 5000. But they kept on saying those numbers.

Do you know that in order to have 2.6 persons per square meter so that each of them has a small (double) bunk bed, you would have to put the bunk beds one next to the other with 20cm-wide spaces in between?

How the hell did they get that number? Is that a sign of their regular investigative quality? Probably.

An application idea

2006-10-24 09:37

Yesterday I wrote that I have too many ideas. Ok, here's another one:

A word processor for writers. And when I say writers, I mean novelists, technical book writers, script writers, playwrights...

Word is not very good for a writer. OpenOffice is not good. KWord is probably worse (because of the emphasis on page layout). LyX is probably as good as it gets, and it's not exactly perfect.

A writer actually needs a simple-ish word processor with a bunch of ancillary gadgetry.

For example:

Statistics:
- How many words/chars/pages a day is he writing
- A live word/char counter
- A live word frequency monitor (put the cursor on a word and see how often it's used)
- Live counter of document/chapter/section/scene size.
Outlining
- Real live outlining. The kind where you drag stuff around and the text follows.
- An editable full-text outline view
Collaboration
- Multiple editors
- Versioning control
Projects
- Multiple files per project
- Linking files to places on the text in other files
Index cards
- Associating index cards to places on the text
- Grouping index cards (for example, per character, or per location)
- Placing them on a timeline or a storyboard
Live Thesaurus / Dictionary
- Show definitions and alternatives as the pointer crosses a word.
- One click replacement
Styling
- Per fragment/paragraph styles
- User defined
- Predefined styles

There are a bazillion things he does not need, though, like detailed page layouting, or grammar checking.

It would be nice if it could later be easily imported (styled!) into something like Scribus so a decent page layout could be done, but it doesn't need to be in the same app at all.

The text engines in Qt4 are good enough for all this app needs graphically.

RestructuredText is good enough to provide a backend, a parser, an exporter, a reader, a transformer, whatever.

So there it is, another idea I will most likely not implement. Someone please run with it, you can probably make it a rather expensive GPL shareware on Mac ;-)

Wifi dongle

2006-10-23 17:54

Bought an Eusso (No, I had never heard of them either) Wifi USB dongle.

Why?

It says "linux driver" on the blister
It's the cheapest 802.11g thing on the local ebay-like place
My ancient pcmcia 802.11b card sucks.

I am thinking of buying half a dozen more and getting rid of all the cables for all my boxes, all of Rosario's office and the guest computers (yes, I do have guest computers. They are there so my guests have their own computers :-).

Plugged it and it worked (ok, I had to install the zd1211 driver which took me 40 seconds). Only problem: it's hot. HOT.

So, need a USB/WiFi thingie that works well in Linux? You can do worse than this baby.

Ralsina.Me — Roberto Alsina's website