ML
    • Recent
    • Categories
    • Tags
    • Popular
    • Users
    • Groups
    • Register
    • Login

    Dell PERC Question (Server Down)

    IT Discussion
    17
    255
    147.6k
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • DashrenderD
      Dashrender
      last edited by

      How does my fan issue get caused by the iLo?

      Would it be something like, the iLo software has a bug, when it tries to read the fans it uses the wrong API calls, which the fans read as an error, and when the fans read an error this spin up?

      scottalanmillerS 1 Reply Last reply Reply Quote 0
      • BRRABillB
        BRRABill
        last edited by

        And I'm not blaming the iDRAC.

        Like I said, I just figured it would a reboot when it got licensed, but it did not. (That's actually the way things should ALWAYS work!)

        And while I said it did not happen in the months prior to enabling the licensing on the iDRAC, that was also the weekend I installed my production mail server on a XS VM on this. SO, there is likely a lot more array activity.

        I have reached out to EDGE and xByte, and will try to work with them further on the issue and report back. They have not been on ML in a while so hopefully they will chime in. (Actually, I will send an e-mail to let them know.)

        L todd-at-xByteT 2 Replies Last reply Reply Quote 0
        • BRRABillB
          BRRABill @scottalanmiller
          last edited by

          @scottalanmiller sai

          If code issues make it past the demarcation point, that's an error on the server side, not the iDRAC side. Even if it is triggered by bad code in the iDRAC, the iDRAC only gets to do as much damage as the server lets it do.

          But the iDARC has full access to the server.

          I understand if I did sometthing stupid, the server would let me, but I just don't see how using the iDRAC I should be expecting that kind of behavior.

          Or that if the iDRAC told the server to do something bad we should be blaming anyone else than the iDRAC.

          scottalanmillerS 1 Reply Last reply Reply Quote 0
          • scottalanmillerS
            scottalanmiller @Dashrender
            last edited by

            @Dashrender said in Dell PERC Question (Server Down):

            How does my fan issue get caused by the iLo?

            Would it be something like, the iLo software has a bug, when it tries to read the fans it uses the wrong API calls, which the fans read as an error, and when the fans read an error this spin up?

            ILO reads the temperature sensors, it might pass the sensors out and then the speed control back in.

            1 Reply Last reply Reply Quote 0
            • scottalanmillerS
              scottalanmiller @BRRABill
              last edited by

              @BRRABill said in Dell PERC Question (Server Down):

              @scottalanmiller sai

              If code issues make it past the demarcation point, that's an error on the server side, not the iDRAC side. Even if it is triggered by bad code in the iDRAC, the iDRAC only gets to do as much damage as the server lets it do.

              But the iDARC has full access to the server.

              I understand if I did sometthing stupid, the server would let me, but I just don't see how using the iDRAC I should be expecting that kind of behavior.

              Or that if the iDRAC told the server to do something bad we should be blaming anyone else than the iDRAC.

              If the code in the iDRAC actually issues a call like "drop a drive", then the iDRAC is being a bad actor, just like you could do from the PERC console. If the issue is that the iDRAC is issueing gibbering and the PERC decides to drop the drive because of gibberish, that's the PERC's fault for doing something it wasn't told to do.

              1 Reply Last reply Reply Quote 0
              • L
                Lyndsie_xByte Vendor @BRRABill
                last edited by

                @BRRABill Thank you for reaching out! Even though I have set up an email notification for this post, I haven't been receiving them. Much appreciate you keeping me in the loop. This is beyond my basic IT knowledge (marketing gal here), but I will alert one of our engineers to see if they can chime in.

                DashrenderD 1 Reply Last reply Reply Quote 1
                • DashrenderD
                  Dashrender @Lyndsie_xByte
                  last edited by

                  @Lyndsie_xByte said in Dell PERC Question (Server Down):

                  @BRRABill Thank you for reaching out! Even though I have set up an email notification for this post, I haven't been receiving them. Much appreciate you keeping me in the loop. This is beyond my basic IT knowledge (marketing gal here), but I will alert one of our engineers to see if they can chime in.

                  Email notices run out after about 5 days or less. I think ML is working on it, but it's a cost issue.

                  JaredBuschJ 1 Reply Last reply Reply Quote 1
                  • todd-at-xByteT
                    todd-at-xByte @BRRABill
                    last edited by

                    @BRRABill
                    I'm coming late into this thread and I'm having problems discerning exactly what the issue is right now. Please contact your xByte rep Brad and he will get a support request going. Our techs can assist directly and can get Edge officially involved instead of trying to rely on ML posts.
                    --Todd

                    BRRABillB scottalanmillerS StrongBadS 3 Replies Last reply Reply Quote 1
                    • JaredBuschJ
                      JaredBusch @Dashrender
                      last edited by

                      @Dashrender said in Dell PERC Question (Server Down):

                      @Lyndsie_xByte said in Dell PERC Question (Server Down):

                      @BRRABill Thank you for reaching out! Even though I have set up an email notification for this post, I haven't been receiving them. Much appreciate you keeping me in the loop. This is beyond my basic IT knowledge (marketing gal here), but I will alert one of our engineers to see if they can chime in.

                      Email notices run out after about 5 days or less. I think ML is working on it, but it's a cost issue.

                      Not exactly. It is a choice to use a service with limits over sending directly and updating records.

                      scottalanmillerS 1 Reply Last reply Reply Quote 0
                      • BRRABillB
                        BRRABill @todd-at-xByte
                        last edited by

                        @todd-at-xByte said

                        @BRRABill
                        I'm coming late into this thread and I'm having problems discerning exactly what the issue is right now. Please contact your xByte rep Brad and he will get a support request going. Our techs can assist directly and can get Edge officially involved instead of trying to rely on ML posts.
                        --Todd

                        Todd:

                        I reached out to Brad yesterday to open a case with your tech support. Though we already kind of went through them and they sent us to EDGE. I've been having problems with EDGE responding to me, which is why I reached back out to Lyndsey who set that up the first time.

                        1 Reply Last reply Reply Quote 2
                        • scottalanmillerS
                          scottalanmiller @JaredBusch
                          last edited by

                          @JaredBusch said in Dell PERC Question (Server Down):

                          @Dashrender said in Dell PERC Question (Server Down):

                          @Lyndsie_xByte said in Dell PERC Question (Server Down):

                          @BRRABill Thank you for reaching out! Even though I have set up an email notification for this post, I haven't been receiving them. Much appreciate you keeping me in the loop. This is beyond my basic IT knowledge (marketing gal here), but I will alert one of our engineers to see if they can chime in.

                          Email notices run out after about 5 days or less. I think ML is working on it, but it's a cost issue.

                          Not exactly. It is a choice to use a service with limits over sending directly and updating records.

                          We tried sending directly and were blacklisted. We could try to get that to work but have tried this in the past and not had luck. We couldn't get even test messages to go out locally. If we switched to local, email would just stop for nearly everyone, all the time, completely.

                          1 Reply Last reply Reply Quote 0
                          • scottalanmillerS
                            scottalanmiller @todd-at-xByte
                            last edited by

                            @todd-at-xByte said in Dell PERC Question (Server Down):

                            @BRRABill
                            I'm coming late into this thread and I'm having problems discerning exactly what the issue is right now. Please contact your xByte rep Brad and he will get a support request going. Our techs can assist directly and can get Edge officially involved instead of trying to rely on ML posts.
                            --Todd

                            Why not get Edge responding here?

                            1 Reply Last reply Reply Quote 1
                            • StrongBadS
                              StrongBad @todd-at-xByte
                              last edited by

                              @todd-at-xByte said in Dell PERC Question (Server Down):

                              I'm coming late into this thread and I'm having problems discerning exactly what the issue is right now.

                              From what I could tell, the issue is that Edge does not respond.

                              BRRABillB 1 Reply Last reply Reply Quote 1
                              • BRRABillB
                                BRRABill @StrongBad
                                last edited by

                                @StrongBad said

                                From what I could tell, the issue is that Edge does not respond.

                                Yes, the tech who was working with me has not responded.

                                Now, in the past few weeks I have dealt with people on vacation, and people who were sick, and everything else. So I always like to give them the benefit of the doubt as to why they are not responding. šŸ™‚

                                DustinB3403D 1 Reply Last reply Reply Quote 1
                                • DustinB3403D
                                  DustinB3403 @BRRABill
                                  last edited by

                                  @BRRABill Any update to share?

                                  1 Reply Last reply Reply Quote 0
                                  • BRRABillB
                                    BRRABill
                                    last edited by

                                    This was the latest e-mail from earlier this afternoon:

                                    "That information is good. I was hoping that your iDRAC log would shine some light on what the actual fault error was being recorded when the drive array is actually going down. I’m working on this now with one of our SSD engineers and I am hoping to have some additional information or potential resolutions about this issue today. "

                                    1 Reply Last reply Reply Quote 2
                                    • StrongBadS
                                      StrongBad
                                      last edited by

                                      Checking in again.

                                      1 Reply Last reply Reply Quote 1
                                      • BRRABillB
                                        BRRABill
                                        last edited by

                                        Have not heard from them today, sadly.

                                        Let me go rattle the cage...

                                        1 Reply Last reply Reply Quote 1
                                        • BRRABillB
                                          BRRABill
                                          last edited by BRRABill

                                          The cage rattling did nothing.

                                          In other news, the RAID array crashed again this morning. Management is starting to ask questions, so I think I am just going to go back to the old DELL spinning rust drives. I don't think I have an option at this point.

                                          This time (this is the fourth time this has happen in a month) was similar to times 1 and 2. In both those instances the entire virtual disk disappeared, as did the physical disks. If you boot into the PERC config, you will see under the FOREIGN tab that the VD and the PD are both there. "Simply" reimport the config, and you're all set.

                                          The third time, the array was still there, it was the disk in 0:0 that was missing. So we cleared the foreign config off of that.

                                          This fourth time, I took more notice of what happened when the array came back up. Sure enough it was 0;0 that was degraded. But, I don't know if I can trust that it might just be that drive.

                                          Here are some pictures of the PERC screens...

                                          0_1461325759560_fri error.png

                                          0_1461325766816_fri error 2.png

                                          0_1461325773018_fri error 3.png

                                          1 Reply Last reply Reply Quote 0
                                          • scottalanmillerS
                                            scottalanmiller
                                            last edited by

                                            It's possible that the PERC is bad, I suppose.

                                            BRRABillB 2 Replies Last reply Reply Quote 0
                                            • 1
                                            • 2
                                            • 5
                                            • 6
                                            • 7
                                            • 8
                                            • 9
                                            • 12
                                            • 13
                                            • 7 / 13
                                            • First post
                                              Last post