Author Topic: Local Website content to PDF?  (Read 5047 times)

Steve

  • This 49%er supports Romney
  • Just a Jackass
  • *
  • Posts: 16120
  • Karma: +31/-410
  • Mr. Mom
Local Website content to PDF?
« on: November 28, 2011, 12:12:08 PM »
I'm trying to archive a website. I used wget and pulled it all onto my local drive, is there a way to convert the content from each page (asp) into PDF automatically?
hey ethic if you and i were both courting lily allen..... oh wait, which one of us has a relationship that lasted more than the bus ride home?

ober

  • Ashton Shagger
  • Ass Wipe
  • Posts: 14310
  • Karma: +73/-790
  • mini-ober is taking over
    • Windy Hill Web Solutions
Re: Local Website content to PDF?
« Reply #1 on: November 28, 2011, 12:14:40 PM »
That sounds like an incredibly horrible idea.

Perspective

  • badfish
  • Jackass In Charge
  • Posts: 4635
  • Karma: +64/-22
    • http://jeff.bagu.org
Re: Local Website content to PDF?
« Reply #2 on: November 28, 2011, 12:28:04 PM »

Steve

  • This 49%er supports Romney
  • Just a Jackass
  • *
  • Posts: 16120
  • Karma: +31/-410
  • Mr. Mom
Re: Local Website content to PDF?
« Reply #3 on: November 28, 2011, 01:17:50 PM »
Why would it be a horrible idea? I'm just trying to save the I formation, the website itself is hideous. Very geocityish.
hey ethic if you and i were both courting lily allen..... oh wait, which one of us has a relationship that lasted more than the bus ride home?

micah

  • A real person, on the Internet.
  • Ass Wipe
  • Posts: 6915
  • Karma: +58/-55
  • Truth cannot contradict truth.
    • micahj.com
Re: Local Website content to PDF?
« Reply #4 on: November 28, 2011, 01:23:03 PM »
since you've wget'd all the HTML, would it be easier to just maybe re-host the site as an "archived" version. 

ie: http://georgewbush-whitehouse.archives.gov/ is snapshot of whitehouse.gov on the last day of the GWB presidency.
"I possess a device, in my pocket, that is capable of accessing the entirety of information known to man.  I use it to look at pictures of cats and get in arguments with strangers."

Steve

  • This 49%er supports Romney
  • Just a Jackass
  • *
  • Posts: 16120
  • Karma: +31/-410
  • Mr. Mom
Re: Local Website content to PDF?
« Reply #5 on: November 28, 2011, 01:37:52 PM »
Yea i considered that, but I was hoping to have it in PDF's for offline use, and don't feel like setting up a local server. That was actually the original plan.
hey ethic if you and i were both courting lily allen..... oh wait, which one of us has a relationship that lasted more than the bus ride home?

micah

  • A real person, on the Internet.
  • Ass Wipe
  • Posts: 6915
  • Karma: +58/-55
  • Truth cannot contradict truth.
    • micahj.com
Re: Local Website content to PDF?
« Reply #6 on: November 28, 2011, 01:56:32 PM »
how many pages is it? could you just print to PDF (using your native print functionality?)
"I possess a device, in my pocket, that is capable of accessing the entirety of information known to man.  I use it to look at pictures of cats and get in arguments with strangers."

Steve

  • This 49%er supports Romney
  • Just a Jackass
  • *
  • Posts: 16120
  • Karma: +31/-410
  • Mr. Mom
Re: Local Website content to PDF?
« Reply #7 on: November 28, 2011, 02:04:01 PM »
About 75
hey ethic if you and i were both courting lily allen..... oh wait, which one of us has a relationship that lasted more than the bus ride home?

webwhy

  • Jackass IV
  • Posts: 608
  • Karma: +15/-10
Re: Local Website content to PDF?
« Reply #8 on: November 28, 2011, 04:58:39 PM »
do you have html or asp?...big difference regarding what you're going to need to do

Steve

  • This 49%er supports Romney
  • Just a Jackass
  • *
  • Posts: 16120
  • Karma: +31/-410
  • Mr. Mom
Re: Local Website content to PDF?
« Reply #9 on: November 28, 2011, 05:21:53 PM »
It's asp
hey ethic if you and i were both courting lily allen..... oh wait, which one of us has a relationship that lasted more than the bus ride home?

webwhy

  • Jackass IV
  • Posts: 608
  • Karma: +15/-10
Re: Local Website content to PDF?
« Reply #10 on: November 28, 2011, 05:38:36 PM »
you're going to have to interpret the asp first in order to get the content.  it will probably involve some sort of database you'll need to connect to as well.

Steve

  • This 49%er supports Romney
  • Just a Jackass
  • *
  • Posts: 16120
  • Karma: +31/-410
  • Mr. Mom
Re: Local Website content to PDF?
« Reply #11 on: November 28, 2011, 05:45:20 PM »
It's seeming easier to just mirror it and do the whole print to pdf nonsense.
hey ethic if you and i were both courting lily allen..... oh wait, which one of us has a relationship that lasted more than the bus ride home?

webwhy

  • Jackass IV
  • Posts: 608
  • Karma: +15/-10
Re: Local Website content to PDF?
« Reply #12 on: November 28, 2011, 05:52:46 PM »
yes

Steve

  • This 49%er supports Romney
  • Just a Jackass
  • *
  • Posts: 16120
  • Karma: +31/-410
  • Mr. Mom
Re: Local Website content to PDF?
« Reply #13 on: November 28, 2011, 05:53:36 PM »
Ok, thanks webby
hey ethic if you and i were both courting lily allen..... oh wait, which one of us has a relationship that lasted more than the bus ride home?

Perspective

  • badfish
  • Jackass In Charge
  • Posts: 4635
  • Karma: +64/-22
    • http://jeff.bagu.org
Re: Local Website content to PDF?
« Reply #14 on: November 28, 2011, 06:51:35 PM »
If you used wget to pull the pages, you have the rendered HTML that is generated by the ASP, not the ASP code itself. It doesn't matter that the extension is .asp, it's still just HTML code. One of the html2pdf projects I linked to above should be able to render pdfs from the HTML, but the quality will vary... also, wget only pulls the page you request, not embedded images and things like that. It also doesn't execute javascript if the page depends on it during loading.