All right, I formatted the manuscript in MS Word (for the Mac), converted it to .htm and uploaded it. The preview screen shows all of the apostrophes and dashes as odd characters instead of the appropriate punctuation. When I download the html file and open it, those odd characters are gone and the proper punctuation returns. Which of these screens do I believe and what do I do to fix it?
This is often caused by the word processor converting things like quotes, etc., into numerical entity references, rather than character entity references. The difference is that in the html you'll see the character displayed as a set of characters starting with & and ending with ; -- but DTP doesn't like ones that have numbers in between.
If you look at http://htmlhelp.com/reference/html40/entities/latin1.html, you'll need to find the characters in your HTML that aren't converting properly, then find the matching "correct" character entity in the "Entity" column in the above linked page. Then do a search and replace in your document (make sure to back up your HTML file first, though!).
Thanks, Kreelan. Would that I knew how to DO any of that. Are you talking about fixing characters that appear dozens of times per page in a 200 page manuscript by poring through the html coding page by page? I'm dizzy.
Well, if you want, feel free to send a copy of your HTML to me at sword @ kreelanwarrior.com and I'll take a look at it (note: it may take a few days, though, and don't be bashful about pinging me - I'm easily distracted!).
You can open the html in a text word processor (maybe even Word) and use search & replace. Replace & #8220; with & ldquo; for example (close up the ampersand and the rest of the code).
Is it nuts that the numerical "entities" won't work? Yes, it's nuts.
It may however be worse than that. Word probably saved your curly quotes etc with the actual symbol, so that when you look at it in NotePad for example what you see is the curly quotes. Now you've got to replace " (imagine those as curling to enclose a quotation from the beginning) with & ldquo; (again closing up the space).
I gave up publishing through Mobipocket because Mobi Creator didn't recognize the numerical "entities". Now I know better.
Google something like this: [b]curly quotes html character entities[/b]
1: document began life in MS Word, formatted with page numbers, paragraph indents, bold-faced chapter headers, and so on.
2: tried converting it as an MS doc, as a .pdf, as an .htm. The last was the closest to looking okay, but it had the funky characters described at the beginning of this.
3: downloaded Stanza, which read the html version of the document fairly well. No more funky characters, but also no more boldface, and it has pulled "Introduction" and "by Trial User" from the html coding. All my paragraph white space is gone.
4: Stanza does allow me to save the document as a .azw, which is great except all of those errors come through in the preview doc on the Kindle page.
5: when I open the html version(s) of the document, I can use the "see source code" to look at the source code behind it, but I can't CHANGE anything. So the "trial user" space holder is in there and I can't do anything but wave at it.
All I want is to have the document look a little bit like the one I've spent the last four days formatting and editing, with chapters bold-faced, my name showing up in the right place and everything else. Nothing I've tried so far has gotten me close to that.
I gave that a try too, actually. The html is closer, but...
Okay, at the point all I want to be able to do is retain some of the white space, and all of the bold face, italics, and underlining, and I can't see any trick that does that. I can't get at the html coding to edit in text edit, as far as I can tell.
I have a book which has been self-published by Bascom Hill Publishing Group, Minneapolis, MN. The publisher tells me that the file of the book is in PDF format.
What is the cost that Amazon charges to convert this PDF file into a file so that Amazon can publish my book in e-book form? I understand that PDF does not convert well to the e-book form preferred by Amazon.
My book is listed on Amazon.com and is 184 pages long. It is called "Surviving High Society".
I have a silly little problem but it's driving me crazy. I convert my MS Word doc. to html and upload that to DTP. Looking at the preview, everything is fine EXCEPT at the beginning of every paragraph which does not have an indent, a small indent appears on the DTP preview. I don't have a Kindle but a friend tells me that these small indents are indeed in the Kindle version of my book. (Example, beginning of new chapter, beginning of new section, the copyright notice.) I have thoroughly checked my word doc and the html version, and neither have these tiny indents. I have checked the Word formatting and can see no reason why these indents should materialize when DTP converts the file. Has this happened to anyone else, can you tell me how to fix this? Thank!