You still have not "cleaned" your MS of all the bad code. As people have said before, either copy and paste the entire manuscript into Notepad and save it as a text (.txt) file to strip absolutely everything and leave the plain text, or do as NotJohn said and copy and paste the Word .doc into the window at the WordtoCleanHTML page. Then copy the resulting HTML and past THAT into Notepad and save it as an HTML file (book.html for instance). Then open that HTMl file with Word and save it as a Word .doc. If you choose this latter route, you'll retain all your italics, underlines, and boldings.
BEFORE YOU DO ANY OF THAT, try simply saving your Word file as Rich Text Format (.rtf) because that will usually remove all stray formatting and retain the italics etc. too. Then shut down Word, re-open Word, re-open that RTF file, and save it as a Word .doc and reformat it.
Why are you trying to save as an XML file anyway? The only XML files in an eBook are the OPF and NCX except for a third XML file needed for ePubs uploaded to Apple. You can't upload XML files to KDP anyway.
I wouldn't save and upload a .docx file but would stay with .doc instead. That is, HTML is always the best way to go, but if you don't know HTML and CSS, then stay with a .doc file.
You've already given everyone here enough information to at least tell you what the issues are, but unfortunately, I guess they're not paying attention.
Quotations for dialogue all start with A instead of open quotes > " and @ instead of closed quotes > " apostrophes show as > = . Very weird
...my first question would have been: did you start this book in Wordperfect? Any commercial formatter should instantly recognize those. It's endemic to files that started their lives in Wordperfect.
But, then you said:
(I used to write in WordPerfect which can show you ALL the codes)
...which is your problem. WP encodes the characters you're using differently than does Word. That's why you have all the symbols, instead of the proper marks. Are you able to edit in HTML, so that you can remove the cruft and replace it with properly-encoded characters? If you can, open the file in an HTML editor, look for the first instance of the @ or the other symbols you're seeing, and then do a search-replace, to put the correct quotation marks back IN the file. If memory serves, using the (absurd) "nuclear" formatting option won't solve your encoding issues.
Nobody bothered to mention, when you said this:
changed font to Calibri which they ay is the default font. Already tried Times New Roman and Arial
...that it doesn't matter WHAT font you choose. That's not what is causing the issue, and Kindle will ignore your font selections anyway unless you're creating books in HTML, and even then, most people don't know how to embed fonts and get them to work. Your problem is the characters that were inserted by Wordperfect, not the font.
Word doesn't offer me an option to save the book in HTML
What version of Word are you using? Word's had that capability since 2003, at least. Are you on a Mac, too, perchance? (Double-trouble, if that's the case). You're not using Works, or something like that, are you? Or, did you use a "save as Word" from within Wordperfect? If you did, that won't work, as you are now seeing.
Find the "save as," choose Web, filtered, if you have it, and then if you know how, as you intimated earlier, do search-replace for all the junk. That will fix it. And don't write in Wordperfect again, unless you use an updated version, and go from Wordperfect-->HTML-->HTML editor-->Cleanup-->KDP.
when I look at the pages, from Location 1 (of 7605) it started with my Cover image goes to location 4 for Tittle Page, then 15, 29, 39 48, 51 (my TOC but it is blank after heading) , and next to 56 which is blank, then I step up through every blank page 1 by 1 before getting to location 131 which is the start. of the novel.
Does anyone know why there are so many skips and blank pages? How can I try to get rid of them?
First, those aren't page numbers, they are LOCATIONS, which is how the Kindle devices indicate at what point a reader is, in a book. eBooks don't have pages, and thus, don't have page numbers. (Go here to our Micro-FAQ, and look at the article about Page numbers: http://www.booknook.biz/faq/ebook-micro-faq/ ). So, the Locations don't go 1-2-3, etc., they appear to "jump" in number. With regard to "blanks," I'd have to see your book to understand what's happening, but you ought not have blank pages, not in an eBook that's properly made. All I can infer from what you're saying is that you might have used page-breaks or section-breaks in places (or worse, at the bottom of each page!) that are then in turn causing the faux-blank-pages. If you do have page-breaks at the bottom of each page, remove all of them that are in the BODY of the story, that are not the end of a chapter. You can keep page breaks between pages or sections of front-matter (like the Copyright page, dedication, Title page, etc.)
You have a fair amount of work ahead of you, but if you can indeed work in HTML, you should be fine. Best of luck to you.
We produce ebooks
Listed as an Amazon Professional Conversion Service: http://bit.ly/uFwMwb
An INScribe Preferred Conversion Partner http://www.booknook.biz/
Follow me on Twitter: @BookNookBiz
"If you think it's expensive to hire a professional to do the job, wait until you hire an amateur."
Dahlia, I know you said you wrote the thing in Word, but your problem sounds exactly like the one I had when I moved files from Word Perfect into Word. As Booknook said, it's endemic to files that started their lives in Wordperfect.
In every single file I moved from WP, quotation marks and apostrophes went to pot. I discovered that, although I’d written in TNR, quotation marks and apostrophes had switched to some strange font. Find and replace didn’t work, and I ended up painstakingly going through the documents and changing every damn one of those symbols to TNR.
I THINK I might have my problem in hand, but I must retype my whole manuscript back into WORD. So far some testing has shown that it SHOULD work.
Keeping my fingers crossed. It will take me all week.
What you should do before you do anything else is to take your original WP manuscript and find somebody with a copy of Word to convert it to a .doc file. DON'T try to convert it to .doc from within WP. It does a terrible job of it.
I must retype my whole manuscript back into WORD. So far some testing has shown that it SHOULD work.
Yes Dahlia, you could take all week to do that. Or you could go the much less time consuming route I suggested of changing the non TNR symbols to TNR. It took me about an hour to do that with a 90,000 word file, and never mind testing to show it SHOULD work, I can assure you doing what I've described DID work.
Thanks for all the interest and help. I think many of you are correct. The file WAS in WordPerfect long ago, yet I had converted it once already, and rewritten lots since then, also did other things to get those WP codes out. And into Word 2007. So yes I am retyping the whole thing, (120K words) back into it.
I have not dared cut and past ONE WORD, so even though it is a supreme effort and killing my back and eyes, it is working, and at least I am rewriting some as I go, so overall, I believe the book will be better when done.
PS I will be back because I am having to make my TOC after this is done.
Hi, I am trying to get my new novel ready. I have the same problem I had before with the dialogue marks. - - - "I'll go." in dialogue
becomes AI>ll go.@ - - -
It makes the " ' and " into these other things?
I have corrected it before but not sure how I did it then???
Hello. I had to type the whole manuscript (Acts of Empathy) all over again into WORD. Took me 4 weeks. lol And I could not use Tab for indents, only did 5 spaces for every one. I was told Tab never works for indents.
Anyway it is all in there now yet even though I use the 5 spaces for my indents, each time I try to review to check typos etc, there always seems to be several that have changed position and are now maybe 4 or 6 spaces indented. Also a lot of double spaces between words somehow.
Wondering if there is any way without relearning my childhood to make things uniform for Kindle???
You shouldn't put your Word file into the downloadable previewer, Dahlia. Rather, upload the Word file, then download the resulting converted file (called a mobi for short, though the reality is more complicated). Load the mobi into the previewer. Good luck!
You are right that tabs don't work for indents. But neither does spacing with the space bar.
What you want to do is to use Styles, which you really should have used all along. At this point I fear giving any advice that would lead to you trying to retype again (which you really didn't need to do the first time, though if you improved your book in retyping that's a plus). So here's a fast-and-dirty way to fix the indents.
First, make sure the pilcrow is turned on. It's a symbol that looks like a backwards letter P. You should see it in the ribbon at the top of your Word document. You can tell if it's turned on if there's a "backwards P" at the end of every paragraph, and if spaces between words appear as faint dots. Click it to turn on if not.
Now go through your manuscript and delete every single space-bar space you used to indent paragraphs. While you're there, you can see if there are extra spaces between words. This will take an hour or two.
Now, for each chapter, highlight the whole thing, first through last paragraph (you can leave the headings as they are). Go up to the Format menu at the top of the page. Choose "paragraph." It will give as one of the choices "first line" with the option of "first line special." Choose that, and indicate an indent of 0.3 inches. Click OK, and all the paragraphs in your chapter will now have a decent indent that won't wander around. Do the same for each chapter.