Skip to main content

This isn't a blog and the content here is not blog posts. Rather, these are random notes I've taken that I've chosen to make public for ease of reference. If you stumble across them and find them helpful, that's great, however, compelling reading and thoughtful prose are not to be found here.

Using Pandoc with Problematic UTF-8 Files

I recently used pandoc to convert some html files to reST. Initially, there were numerous errors about invalid characters. Here is the brute-force solution I arrived at to get the job done while attempting to reasonably convert offending characters into something reasonable.

Read more…

Converting MS Word DOCX Document to reStructuredText

A client has a substantial body of written work that began life as MS Word documents. They also created many derivative documents in various formats (including HTML, PDF, Dreamweaver, MS Front Page, etc.) for different use cases. Our recommendation was that, going forward, they should create content once, in a canonical format, from which they could generate documents in whatever format they need(HTML, PDF, etc.). That subject is beyond the scope of this note. This note is focused on a quick and dirty method to access their assets in MS Word format and convert them into something more widely useful.

Read more…

Satchmo Questions

This note grew out of questions that went unanswered as I pushed through my first Satchmo install. Or, more accurately, it documents the questions that I had and still have. Rather than let them clutter and muddy my install notes, I have moved them here.

Read more…

Colophon

This site is built with the uikit framework and the nikola static blog/site generator and its associated lanyon theme. I particularly enjoy using nikola and find the lanyon theme to be clean, simple, and easy to read.

Read more…