Topic: Word text to HTML (Page 1 of 1) |
|
---|---|
Paranoid (IV) Inmate From: Mpls, MN |
![]() Ok, I've got a stupid question. |
Paranoid (IV) Inmate From: Norway |
![]() HTML Tidy for the win! |
Paranoid (IV) Inmate From: Mpls, MN |
![]() Thanks, |
Paranoid (IV) Inmate From: Norway |
![]() Last time I tried, i.e. years ago, DreamWeaver did a good job at removing the crap in WORD's markup. But HTML Tidy is supposed to it very well too. It's worth google'ing for examples on how to do it. |
Paranoid (IV) Inmate From: Florida |
![]() You just need some sed(/ish) scripts, or an editor that does macros, etc. If Word can save as HTML and the only problem is that decent HTML is surrounded by tons of crap HTML, it might be best to go that route - save as HTML from Word, then run it through a script/macros that clean out the crap. |
Paranoid (IV) Inmate From: Mpls, MN |
![]() I got it Tidy working perfectly now. I found if I export the doc as HTML from word then run it trough HTML Tidy twice. I get exactly what I was after. WeBuilder 2007 has a very nice front end to Tidy, but TidyGui would do the trick as well. |
Paranoid (IV) Inmate From: Florida |
![]() Twice, ha. Never tried that; neat. |
Obsessive-Compulsive (I) Inmate From: |
![]() I wouldn't trust a software package with a ten foot pole for this. But coming from a Search Engine Optimization I like to have my content with as minimal an amount as possible of markup. If I am using a WYSIWYG editor I copy and paste into Textpad first, then build the page. If its Dreamweaver I paste right into the source view and wrap it in tags as I go. I find it really doesn't take that much time with the exception of tabular data. Cheers. |