ML
    • Recent
    • Categories
    • Tags
    • Popular
    • Users
    • Groups
    • Register
    • Login

    Large or small Raid 5 with SSD

    Scheduled Pinned Locked Moved IT Discussion
    97 Posts 7 Posters 6.9k Views
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • scottalanmillerS
      scottalanmiller @Donahue
      last edited by

      @Donahue said in Large or small Raid 5 with SSD:

      Ok, lets add a layer to this. Lets assume the raid 5 will lose a disk. Do I run with no spare of any kind, and when it fails, then buy a replacement and switch it out?

      You can, lots of places with four hour SLA hardware replacement plans do that. I wouldn't do that without a warranty to cover the replacements, though.

      1 Reply Last reply Reply Quote 0
      • DonahueD
        Donahue @scottalanmiller
        last edited by

        @scottalanmiller said in Large or small Raid 5 with SSD:

        @Donahue said in Large or small Raid 5 with SSD:

        Is the URE risk primarily during rebuild, or anytime it is in a degraded state?

        URE is quite nominal on SSDs typically. Not zero, but not like you are used to, either.

        but is the risk only present one I initiate a rebuild? As in, if a primary failure occurs, do I have time to assess my options before starting? I am basically trying to figure out if I should buy 4 or 5 drives. I know you said earlier that with raid 5, you may as well add that 5th drive to the array and make it a raid 6 as opposed to sitting on the shelf.

        scottalanmillerS JaredBuschJ 3 Replies Last reply Reply Quote 0
        • DonahueD
          Donahue
          last edited by

          I am probably looking at more like next day replacement

          1 Reply Last reply Reply Quote 1
          • DonahueD
            Donahue @scottalanmiller
            last edited by

            @scottalanmiller said in Large or small Raid 5 with SSD:

            @Donahue said in Large or small Raid 5 with SSD:

            also, am I right to assume that network contention can influence IOPS?

            Resulting IOPS to a third party service, but not IOPS themselves.

            It will certainly improve latency. That synology is averaging 14.6ms reads, with spikes over 280. writes are averaging 4.5ms with spikes over 200.

            1 Reply Last reply Reply Quote 0
            • scottalanmillerS
              scottalanmiller @Donahue
              last edited by

              @Donahue said in Large or small Raid 5 with SSD:

              @scottalanmiller said in Large or small Raid 5 with SSD:

              @Donahue said in Large or small Raid 5 with SSD:

              Is the URE risk primarily during rebuild, or anytime it is in a degraded state?

              URE is quite nominal on SSDs typically. Not zero, but not like you are used to, either.

              but is the risk only present one I initiate a rebuild? As in, if a primary failure occurs, do I have time to assess my options before starting? I am basically trying to figure out if I should buy 4 or 5 drives. I know you said earlier that with raid 5, you may as well add that 5th drive to the array and make it a raid 6 as opposed to sitting on the shelf.

              Yes, but if you are waiting, that's when you create the risk of a second drive failing. Because your time exposure goes from a few hours to potentially days. That's a lot of expansion.

              1 Reply Last reply Reply Quote 0
              • DonahueD
                Donahue
                last edited by

                just to clarify, we are talking about two different risks, with two different triggers, correct? The risk of a second disk failure while degraded, which is triggered the moment the first disk dies. The second risk (and less so for SSD) is URE, but my question is does this risk only trigger once you initiate a rebuild? Because it is the rebuild itself that is trying to read the unreadable block during its parity calculation?

                scottalanmillerS 1 Reply Last reply Reply Quote 0
                • JaredBuschJ
                  JaredBusch @Donahue
                  last edited by

                  @Donahue said in Large or small Raid 5 with SSD:

                  @scottalanmiller said in Large or small Raid 5 with SSD:

                  @Donahue said in Large or small Raid 5 with SSD:

                  Is the URE risk primarily during rebuild, or anytime it is in a degraded state?

                  URE is quite nominal on SSDs typically. Not zero, but not like you are used to, either.

                  but is the risk only present one I initiate a rebuild? As in, if a primary failure occurs, do I have time to assess my options before starting? I am basically trying to figure out if I should buy 4 or 5 drives. I know you said earlier that with raid 5, you may as well add that 5th drive to the array and make it a raid 6 as opposed to sitting on the shelf.

                  I never do hot spare. If you are going to have it plugged in, use it. Make it a RAID 6.

                  1 Reply Last reply Reply Quote 2
                  • scottalanmillerS
                    scottalanmiller @Donahue
                    last edited by

                    @Donahue said in Large or small Raid 5 with SSD:

                    just to clarify, we are talking about two different risks, with two different triggers, correct? The risk of a second disk failure while degraded, which is triggered the moment the first disk dies. The second risk (and less so for SSD) is URE, but my question is does this risk only trigger once you initiate a rebuild? Because it is the rebuild itself that is trying to read the unreadable block during its parity calculation?

                    The URE risk only triggers once you trigger a rebuild, but the shift risk happens the moment you delay replacing the disk. You can't win through that thought process.

                    DashrenderD 1 Reply Last reply Reply Quote 0
                    • scottalanmillerS
                      scottalanmiller @Donahue
                      last edited by

                      @Donahue said in Large or small Raid 5 with SSD:

                      I know you said earlier that with raid 5, you may as well add that 5th drive to the array and make it a raid 6 as opposed to sitting on the shelf.

                      Not "might as well", but "had better make sure you do." Difference in risk is astronomic. If you are even thinking hot spare is an option, we've not explain adequately how it works.

                      DashrenderD DonahueD 2 Replies Last reply Reply Quote 1
                      • DashrenderD
                        Dashrender @scottalanmiller
                        last edited by

                        @scottalanmiller said in Large or small Raid 5 with SSD:

                        @Donahue said in Large or small Raid 5 with SSD:

                        just to clarify, we are talking about two different risks, with two different triggers, correct? The risk of a second disk failure while degraded, which is triggered the moment the first disk dies. The second risk (and less so for SSD) is URE, but my question is does this risk only trigger once you initiate a rebuild? Because it is the rebuild itself that is trying to read the unreadable block during its parity calculation?

                        The URE risk only triggers once you trigger a rebuild, but the shift risk happens the moment you delay replacing the disk. You can't win through that thought process.

                        How is the URE not a risk the instant the first drive fails. Can't a URE happen during normal disk operation? i.e. you're in degraded status - and while reading before starting the rebuild, hit an URE?

                        travisdh1T scottalanmillerS 2 Replies Last reply Reply Quote 0
                        • DashrenderD
                          Dashrender @scottalanmiller
                          last edited by

                          @scottalanmiller said in Large or small Raid 5 with SSD:

                          @Donahue said in Large or small Raid 5 with SSD:

                          I know you said earlier that with raid 5, you may as well add that 5th drive to the array and make it a raid 6 as opposed to sitting on the shelf.

                          Not "might as well", but "had better make sure you do." Difference in risk is astronomic. If you are even thinking hot spare is an option, we've not explain adequately how it works.

                          Assuming the performance hit is as low as Scott claims (and I'm sure he's right) then there would be no reason to not put the protection in place now - you have the drive, just use it. Sitting it on the shelf introduces risk you don't need to take - the amount of time for you to be notified, and then act upon that notification before a second drive fails. Why expose that risk when you don't have to.

                          1 Reply Last reply Reply Quote 1
                          • travisdh1T
                            travisdh1 @Dashrender
                            last edited by

                            @Dashrender said in Large or small Raid 5 with SSD:

                            @scottalanmiller said in Large or small Raid 5 with SSD:

                            @Donahue said in Large or small Raid 5 with SSD:

                            just to clarify, we are talking about two different risks, with two different triggers, correct? The risk of a second disk failure while degraded, which is triggered the moment the first disk dies. The second risk (and less so for SSD) is URE, but my question is does this risk only trigger once you initiate a rebuild? Because it is the rebuild itself that is trying to read the unreadable block during its parity calculation?

                            The URE risk only triggers once you trigger a rebuild, but the shift risk happens the moment you delay replacing the disk. You can't win through that thought process.

                            How is the URE not a risk the instant the first drive fails.

                            It is a risk, but because we're talking SSD specifically, the chance of a URE failure is exponentially smaller than a HDD. The flash will normally fail before a URE. Not impossible, just the chance of it actually happening is much smaller than other failures happening.

                            Can't a URE happen during normal disk operation? i.e. you're in degraded status - and while reading before starting the rebuild, hit an URE?

                            Normal operation of the RAID would correct the issue. Degraded status depends on the type of RAID IE: RAID6 degraded mode should function as a RAID5, so a URE doesn't become a problem until the 2nd drive fails.

                            Again, URE is not an expected failure point for SSD drives. Not that it can't happen, it's just very unlikely.

                            scottalanmillerS 1 Reply Last reply Reply Quote 0
                            • scottalanmillerS
                              scottalanmiller @Dashrender
                              last edited by

                              @Dashrender said in Large or small Raid 5 with SSD:

                              @scottalanmiller said in Large or small Raid 5 with SSD:

                              @Donahue said in Large or small Raid 5 with SSD:

                              just to clarify, we are talking about two different risks, with two different triggers, correct? The risk of a second disk failure while degraded, which is triggered the moment the first disk dies. The second risk (and less so for SSD) is URE, but my question is does this risk only trigger once you initiate a rebuild? Because it is the rebuild itself that is trying to read the unreadable block during its parity calculation?

                              The URE risk only triggers once you trigger a rebuild, but the shift risk happens the moment you delay replacing the disk. You can't win through that thought process.

                              How is the URE not a risk the instant the first drive fails. Can't a URE happen during normal disk operation? i.e. you're in degraded status - and while reading before starting the rebuild, hit an URE?

                              It's not an array risk outside of a rebuild. It's rebuilding that causes the cascade of the URE to make the whole array unreadable.

                              That a URE can happen is not the fear. That a URE can happen during a rebuild is the fear. Because the array is a "single file" being rewritten during a rebuild and the write operation cannot complete. Leaving everything lost.

                              1 Reply Last reply Reply Quote 2
                              • scottalanmillerS
                                scottalanmiller @travisdh1
                                last edited by

                                @travisdh1 said in Large or small Raid 5 with SSD:

                                Normal operation of the RAID would correct the issue. Degraded status depends on the type of RAID IE: RAID6 degraded mode should function as a RAID5, so a URE doesn't become a problem until the 2nd drive fails.

                                To be clear, a URE during normal degraded operations does impact one file, but not the array. From the point of view of the array, nothing is wrong. During a rebuild, that same URE takes out the entire array in a parity RAID system. So very different results from the same URE.

                                1 DashrenderD 2 Replies Last reply Reply Quote 2
                                • 1
                                  1337 @scottalanmiller
                                  last edited by 1337

                                  @scottalanmiller said in Large or small Raid 5 with SSD:

                                  @travisdh1 said in Large or small Raid 5 with SSD:

                                  Normal operation of the RAID would correct the issue. Degraded status depends on the type of RAID IE: RAID6 degraded mode should function as a RAID5, so a URE doesn't become a problem until the 2nd drive fails.

                                  To be clear, a URE during normal degraded operations does impact one file, but not the array. From the point of view of the array, nothing is wrong. During a rebuild, that same URE takes out the entire array in a parity RAID system. So very different results from the same URE.

                                  Yes, but shouldn't every raid array be setup to do data scrubbing? If you setup mdadm you for sure get data scrubbing by default.

                                  If you scrub the data on a regular basis it's unlikely that an URE should show up just when one SSD has failed. And as in all cases backup is the final solution.

                                  1 scottalanmillerS 2 Replies Last reply Reply Quote 0
                                  • DonahueD
                                    Donahue @scottalanmiller
                                    last edited by

                                    @scottalanmiller said in Large or small Raid 5 with SSD:

                                    @Donahue said in Large or small Raid 5 with SSD:

                                    I know you said earlier that with raid 5, you may as well add that 5th drive to the array and make it a raid 6 as opposed to sitting on the shelf.

                                    Not "might as well", but "had better make sure you do." Difference in risk is astronomic. If you are even thinking hot spare is an option, we've not explain adequately how it works.

                                    I was thinking cold spare, not hot spare. I don't want the array rebuilding automatically before I have time to make a conscience decision to do it. But the different is similar, I still would have a spare and is not helping the array at all just sitting on the shelf.

                                    scottalanmillerS 1 Reply Last reply Reply Quote 0
                                    • 1
                                      1337 @1337
                                      last edited by

                                      Also read operations have zero impact on SSDs contrary to spinning harddrives.

                                      1 Reply Last reply Reply Quote 1
                                      • DonahueD
                                        Donahue
                                        last edited by

                                        I know the analogy is not perfect, but in my head I am thinking of the spare disk as a spare tire on a car. having a cold spare on the shelf to me is like having the spare tire mounted to the back or underneath the car, not being actively used to help the car stay on the road. So my instinct is to make sure I've got a spare. In the case of a 4 drive raid 5, that means a 5th disk. But as you say, IF I have that disk anyways, it is better, and as you say, emphatically so, to actually use that disk in the array from the beginning and have a 5 disk raid 6 and no spare. But that leads me back to my original position of not having a spare which my animal brain intuitively thinks of as bad and that I should get a spare. I know that my assumptions and instincts are wrong here, because I do not fully understand the scope of the difference in risks between the 4 drive raid 5 and the 5 drive raid 6. That is why I am asking all these questions, so that I can more fully understand my options and evaluate my choices based on empirical data or good logic, and not on instinct or intuition.

                                        DashrenderD 1 Reply Last reply Reply Quote 0
                                        • DonahueD
                                          Donahue
                                          last edited by

                                          I am still thinking of the problem as being one of linear risk and safety, not logarithmic, and that is my fundamental flaw I think.

                                          scottalanmillerS 1 Reply Last reply Reply Quote 0
                                          • DashrenderD
                                            Dashrender @scottalanmiller
                                            last edited by

                                            @scottalanmiller said in Large or small Raid 5 with SSD:

                                            @travisdh1 said in Large or small Raid 5 with SSD:

                                            Normal operation of the RAID would correct the issue. Degraded status depends on the type of RAID IE: RAID6 degraded mode should function as a RAID5, so a URE doesn't become a problem until the 2nd drive fails.

                                            To be clear, a URE during normal degraded operations does impact one file, but not the array. From the point of view of the array, nothing is wrong. During a rebuild, that same URE takes out the entire array in a parity RAID system. So very different results from the same URE.

                                            AWWWW - this is what I was missing. OK a normal read operation will only break one file. Thanks. that explains a lot!

                                            scottalanmillerS 1 Reply Last reply Reply Quote 1
                                            • 1
                                            • 2
                                            • 3
                                            • 4
                                            • 5
                                            • 3 / 5
                                            • First post
                                              Last post