My Typo Fixed EPUB (Mises.org): https://www.mises.org/document/6995/Why-American-History-Is-Not-What-They-Say
User Submitted EPUB (if wanted for comparison)
Original EPUB (Junk Calibre Conversion)
Original PDF
Fix Notes:
This EPUB was submitted by a user to the Mises Institute. The user cleaned the original EPUB by stripping out all/most of the calibre conversion, and fixed:
- TOC
- Formatting (LOOKS VASTLY SUPERIOR)
- Footnotes
- File Structure
I went ahead and took the cleaned version and fixed hundreds of typos (mostly hyphenation issues). Here is my typo fixes from the original/fixed EPUB.
Hyphenation Errors:
2 John Dos Pas-sos
2 Germa-ny
Roth-bard
oth-ers
Epis-temology
nonfic-tion
Stim-son
forth-rightly
Vid-al
Heca-taeus
Tuck-er
invad-ers
an-archo-communists
stat-ism
Strom-berg
DiLoren-zo
histo-riographical
mega-lomaniacal
libertarian-ism
col-lege
psycho-epistemo-logical
signifi-cant
Hit-lerism
de-pradations
[r]eform-ers
crystal-ised
rea-sons
Sch-weikart
effec-tively
develop-ment
Mil-lis
sa-credness
ibertar-ianism
Apple-man
unalien-able
Hard-ing
Spacing Errors:
36 accidental around '—'
52 non-spaced "[...]"
12 spaces before footnotes
2 J ohn
J ohnathan
2 J ames
j ournalism
[v] iewed
J ohnson
J efferson
McCor-mick
r evisionist
"blue/ grey"
Wrong Italics:
15 parenthesis
189 commas
79 periods
22 quotation marks
3 books improperly italic
3 Accidental links
1 Accidental '^'
Missing punctuation:
Right before footnote 168
"Levellers of the English Civil War (16421647)"
"story of Henri de Saint-Simon (17601825)"
"Reagan's eight years (19661974)"
"Charles Austin Beard (18741948)"
Added accents:
ancien régime
naïve -> naïve
Missing blockquote:
Around Footnote 276
Beginning of Section 5 Chapter VI
Footnote Fixes:
Footnote 12 was missing "1956), p.74."
Footnote 28 link fixed
Footnote 52 link fixed
Footnote 116 had "I99o), p. i8." -> "1990), p. 18."
Footnote 141 link fixed
Footnote 238 "[yaf and SDs]" -> "[YAF and SDS]
Footnote 286 link fixed
Footnote 327 "A. of Leviathan" -> "A New History of Leviathan"
Footnote 335 Superscript problems, and some typos
Footnote 363 link fixed
Footnote 376 link fixed
Footnote 378 link fixed
TOC Fix:
First link to Chapter 3 was broken. Changed "chap1" to "chap3"
Changed to Life of Washington."
- [...] favorite reading was Parson Weems' Life of Washington!
Godkin's Nation"
- [...] Godkin's Nation' [...]
Changed to "UCV"
- United Confederate Veterans (ucv)
I also use Calibre. How is it possible to clean up the conversions?
ReplyDeleteIn order to clean up Calibre bloat, I would recommend having a basic knowledge of HTML+CSS, and Regular Expressions (not needed but helps A TON).
DeleteI stay away from Calibre as much as possible, as it should only be used when doing conversions for your own personal usage. It creates very bloated code.
I personally use Sigil for all my EPUB creating/editing, and for "non-destructive" editing I just use Notepad++ (or any text editor with Regular Expression support).
Every book that calibre converts will be slightly different, but if you take a look at the Calibre generated stylesheet, you will see Calibre inserting tens (sometimes hundreds) of different "calibre###" classes. Where all of them basically do the same exact thing.
calibre1 will have a "text-indent: 1.2em"
calibre2 will have a "text-indent: 1.22em"
calibre3 will have a "text-indent: 1.23em"
...
calibre99 will have a "text-indent:1.1589em"
calibre123 will have a "margin-top: 1em"
calibre124 will have a "margin-top: 1.01em"
...
So I just use a regular expression to remove all classes of calibre1 -> calibre99, and have my generic stylesheet have a p with "text-indent:1.2em". Then just go through the book/stylesheet to figure out what every calibre class does, and remove it from the actual book.
This cuts down the amount of classes used in the book from hundreds down to a handful.
I should definitely do an in depth post on this topic, and also an in depth post on my typical workflow.
That explains a lot. Thank you very much!
ReplyDelete