TechZilla | September 21, 2007
I set up a new site last month at Tech-Zilla.com, the site is just a large repo of manpages. I hacked up the man2html script and then ran the output through html2xml. I automated the entire process and called the script "the Manhandler".
Now i'm thinking "look at this completely static easily index pages" ....wrong the phpscript i used for the front end had googlebot running in circles. (making guesses on url's that led to 404's) So i made alphabetical links at the head of my entrance page to the back of each apache directory listing. I mean how much easier could i make it, people in the front bots in the back...right? wrong again, seems like i got indexed a few man pages and then googlebot decided it was finished. WTF
So i made a sitemap and this better be the answer or i'm going to be furious; considering i love the xml 1.0 sleekness of the site. Never have men2html pages look so acceptable, and even better static... unlike the script intended.
Now i'm thinking "look at this completely static easily index pages" ....wrong the phpscript i used for the front end had googlebot running in circles. (making guesses on url's that led to 404's) So i made alphabetical links at the head of my entrance page to the back of each apache directory listing. I mean how much easier could i make it, people in the front bots in the back...right? wrong again, seems like i got indexed a few man pages and then googlebot decided it was finished. WTF
So i made a sitemap and this better be the answer or i'm going to be furious; considering i love the xml 1.0 sleekness of the site. Never have men2html pages look so acceptable, and even better static... unlike the script intended.
Posted 3 years, 2 months ago on September 21, 2007
The trackback url for this post is http://techzilla.biz/bblog/trackback.php/12/
The trackback url for this post is http://techzilla.biz/bblog/trackback.php/12/
Comments on this post:
Comments have now been turned off for this post