ML
    • Recent
    • Categories
    • Tags
    • Popular
    • Users
    • Groups
    • Register
    • Login

    Downloading full Website offline

    IT Discussion
    wget website download
    9
    41
    2.4k
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • scottalanmillerS
      scottalanmiller @dbeato
      last edited by

      @dbeato said in Downloading full Website offline:

      @Dashrender said in Downloading full Website offline:

      I wonder how much storage is needed?

      For example ML took about 24 GB of two days downloading, I stopped it because I didn't need it.

      Been a few days and still running here.

      1 Reply Last reply Reply Quote 0
      • dbeatoD
        dbeato @Dashrender
        last edited by

        @Dashrender said in Downloading full Website offline:

        @dbeato said in Downloading full Website offline:

        @Dashrender said in Downloading full Website offline:

        I wonder how much storage is needed?

        For example ML took about 24 GB of two days downloading, I stopped it because I didn't need it.

        lol, not the site I was talking about 😛

        Spiceworks about 87 GB to 100 GB only on posts.

        scottalanmillerS 1 Reply Last reply Reply Quote 0
        • scottalanmillerS
          scottalanmiller @dbeato
          last edited by

          @dbeato said in Downloading full Website offline:

          @Dashrender said in Downloading full Website offline:

          @dbeato said in Downloading full Website offline:

          @Dashrender said in Downloading full Website offline:

          I wonder how much storage is needed?

          For example ML took about 24 GB of two days downloading, I stopped it because I didn't need it.

          lol, not the site I was talking about 😛

          Spiceworks about 87 GB to 100 GB only on posts.

          13.5GB on ML so far.

          1 Reply Last reply Reply Quote 0
          • scottalanmillerS
            scottalanmiller
            last edited by

            ML has very little "media" on the site. So that doesn't expand very quickly.

            1 Reply Last reply Reply Quote 0
            • black3dynamiteB
              black3dynamite
              last edited by

              Is there a way to do a dry run just too see how much storage will be consumed without actually downloading?

              dbeatoD scottalanmillerS 2 Replies Last reply Reply Quote 0
              • dbeatoD
                dbeato @black3dynamite
                last edited by

                @black3dynamite said in Downloading full Website offline:

                Is there a way to do a dry run just too see how much storage will be consumed without actually downloading?

                No that I know of.

                1 Reply Last reply Reply Quote 0
                • scottalanmillerS
                  scottalanmiller @black3dynamite
                  last edited by

                  @black3dynamite said in Downloading full Website offline:

                  Is there a way to do a dry run just too see how much storage will be consumed without actually downloading?

                  No, the only way to know the size is to grab every file and add it up. You could come up with a way to store that info and not store the files, but no way to not download it all, add it up, and then know. So not really any value to a dry run, it would hit all the same things as the real deal.

                  1 Reply Last reply Reply Quote 0
                  • black3dynamiteB
                    black3dynamite
                    last edited by

                    @dbeato @scottalanmiller

                    https://github.com/mariomaric/website-size

                    dbeatoD travisdh1T scottalanmillerS 3 Replies Last reply Reply Quote 1
                    • dbeatoD
                      dbeato @black3dynamite
                      last edited by

                      @black3dynamite said in Downloading full Website offline:

                      @dbeato @scottalanmiller

                      https://github.com/mariomaric/website-size

                      Pretty cool, that's why I say that I don't know of 🙂

                      1 Reply Last reply Reply Quote 0
                      • travisdh1T
                        travisdh1 @black3dynamite
                        last edited by

                        @black3dynamite said in Downloading full Website offline:

                        @dbeato @scottalanmiller

                        https://github.com/mariomaric/website-size

                        That is literally saving the website to a temp directory. Why not just do it once instead of twice?

                        scottalanmillerS 1 Reply Last reply Reply Quote 0
                        • scottalanmillerS
                          scottalanmiller @black3dynamite
                          last edited by

                          @black3dynamite said in Downloading full Website offline:

                          @dbeato @scottalanmiller

                          https://github.com/mariomaric/website-size

                          But does it really not download everything? Websites don't report on the size directly AFAIK.

                          black3dynamiteB 1 Reply Last reply Reply Quote 0
                          • scottalanmillerS
                            scottalanmiller @travisdh1
                            last edited by

                            @travisdh1 said in Downloading full Website offline:

                            @black3dynamite said in Downloading full Website offline:

                            @dbeato @scottalanmiller

                            https://github.com/mariomaric/website-size

                            That is literally saving the website to a temp directory. Why not just do it once instead of twice?

                            Right, which is what I had originally said you could do. And doing so to a temp directory is STILL downloading the whole thing - the very thing you are trying to avoid.

                            1 Reply Last reply Reply Quote 1
                            • DustinB3403D
                              DustinB3403
                              last edited by

                              @dbeato are you specifying a user account for this to run again?

                              scottalanmillerS dbeatoD 2 Replies Last reply Reply Quote 0
                              • scottalanmillerS
                                scottalanmiller @DustinB3403
                                last edited by

                                @DustinB3403 said in Downloading full Website offline:

                                @dbeato are you specifying a user account for this to run again?

                                No, he'd have to pass a cookie to do that.

                                DustinB3403D JaredBuschJ 2 Replies Last reply Reply Quote 1
                                • dbeatoD
                                  dbeato @DustinB3403
                                  last edited by

                                  @DustinB3403 said in Downloading full Website offline:

                                  @dbeato are you specifying a user account for this to run again?

                                  I am not.

                                  1 Reply Last reply Reply Quote 0
                                  • DustinB3403D
                                    DustinB3403 @scottalanmiller
                                    last edited by

                                    @scottalanmiller okay. . .

                                    So I know exactly why @dbeato is going through this process. The followup is then, how do you sort out the things you don't want from the website and retain those for later?

                                    dbeatoD 1 Reply Last reply Reply Quote 0
                                    • dbeatoD
                                      dbeato @DustinB3403
                                      last edited by

                                      @DustinB3403 said in Downloading full Website offline:

                                      @scottalanmiller okay. . .

                                      So I know exactly why @dbeato is going through this process. The followup is then, how do you sort out the things you don't want from the website and retain those for later?

                                      That is what I would do slowly in my spare time 🙂

                                      DustinB3403D 1 Reply Last reply Reply Quote 0
                                      • JaredBuschJ
                                        JaredBusch @scottalanmiller
                                        last edited by

                                        @scottalanmiller said in Downloading full Website offline:

                                        @DustinB3403 said in Downloading full Website offline:

                                        @dbeato are you specifying a user account for this to run again?

                                        No, he'd have to pass a cookie to do that.

                                        48a5ebae-e0bc-45b5-81d5-a549493376fd-image.png

                                        1 Reply Last reply Reply Quote 1
                                        • DustinB3403D
                                          DustinB3403 @dbeato
                                          last edited by

                                          @dbeato said in Downloading full Website offline:

                                          @DustinB3403 said in Downloading full Website offline:

                                          @scottalanmiller okay. . .

                                          So I know exactly why @dbeato is going through this process. The followup is then, how do you sort out the things you don't want from the website and retain those for later?

                                          That is what I would do slowly in my spare time 🙂

                                          Ugh. . .

                                          1 Reply Last reply Reply Quote 0
                                          • black3dynamiteB
                                            black3dynamite @scottalanmiller
                                            last edited by

                                            @scottalanmiller said in Downloading full Website offline:

                                            @black3dynamite said in Downloading full Website offline:

                                            @dbeato @scottalanmiller

                                            https://github.com/mariomaric/website-size

                                            But does it really not download everything? Websites don't report on the size directly AFAIK.

                                            Using --spider, does not download the pages.
                                            Using --no-directories, tells wget to create empty directories.

                                            scottalanmillerS 1 Reply Last reply Reply Quote 0
                                            • 1
                                            • 2
                                            • 3
                                            • 2 / 3
                                            • First post
                                              Last post