ML
    • Recent
    • Categories
    • Tags
    • Popular
    • Users
    • Groups
    • Register
    • Login

    Solved PCI bus error

    IT Discussion
    dell poweredge poweredge 2850 lspci bmc ipmi
    10
    55
    3.7k
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • DustinB3403D
      DustinB3403 @travisdh1
      last edited by

      @travisdh1 said in PCI bus error:

      @JaredBusch Most likely a failing NIC, yes.

      Also, talk about a blast from the past. That's the same era of hardware as a Vostro 220 I pulled out of service yesterday.

      Jared seems to be trying to keep this unit in service, unless of course there is a good reason to replace it.

      travisdh1T 1 Reply Last reply Reply Quote 0
      • travisdh1T
        travisdh1 @DustinB3403
        last edited by

        @DustinB3403 said in PCI bus error:

        @travisdh1 said in PCI bus error:

        @JaredBusch Most likely a failing NIC, yes.

        Also, talk about a blast from the past. That's the same era of hardware as a Vostro 220 I pulled out of service yesterday.

        Jared seems to be trying to keep this unit in service, unless of course there is a good reason to replace it.

        Yeah, this is @JaredBusch tho, I think we all know he's not keeping it in service by his choice.

        1 Reply Last reply Reply Quote 0
        • JaredBuschJ
          JaredBusch
          last edited by

          So, no one can hep me confirm that the NIC is failing?

          Also, anything I can look at to see if this is the onboard NIC or the NIC on a separate card?

          I'll be on site on Tuesday.

          ObsolesceO 1 Reply Last reply Reply Quote 0
          • ObsolesceO
            Obsolesce @JaredBusch
            last edited by

            @JaredBusch said in PCI bus error:

            So, no one can hep me confirm that the NIC is failing?

            Also, anything I can look at to see if this is the onboard NIC or the NIC on a separate card?

            I'll be on site on Tuesday.

            Did not see any info indicating NIC failure. No idea what device 6 is.

            Looking up the Intel NIC u linked shows this:

            Screenshot_20201213-113521_Edge.jpg

            1 Reply Last reply Reply Quote 0
            • 1
              1337
              last edited by 1337

              No, it's not the NIC.

              It says PCIe error bus 0, device 6, function 0.

              That's 00:06.0, and that is the PCI bridge E7520. Which I think is connected directly to the chipset on the CPU. Can't remember exactly what was on the CPU and the chipset back in those days.

              Either way the motherboard/CPU is done.

              Or I guess technically speaking a driver error caused by drive corruption could have caused the same error. After all it's the OS that gives the error message here.

              JaredBuschJ 2 Replies Last reply Reply Quote 3
              • JaredBuschJ
                JaredBusch @1337
                last edited by

                @Pete-S said in PCI bus error:

                After all it's the OS that gives the error message here.

                No the error message if from the BMC (predecessor to iDRAC).

                1 Reply Last reply Reply Quote 0
                • JaredBuschJ
                  JaredBusch @1337
                  last edited by

                  @Pete-S said in PCI bus error:

                  No, it's not the NIC.
                  It says PCIe error bus 0, device 6, function 0.

                  That is why I wanted others to look. The way I read the man page it seemed that the bus was omitted when using lspci.

                  1 1 Reply Last reply Reply Quote 0
                  • 1
                    1337 @JaredBusch
                    last edited by

                    @JaredBusch said in PCI bus error:

                    @Pete-S said in PCI bus error:

                    No, it's not the NIC.
                    

                    It says PCIe error bus 0, device 6, function 0.

                    That is why I wanted others to look. The way I read the man page it seemed that the bus was omitted when using lspci.

                    No its <bus>:<device>.<func>

                    But it's a bit confusing nowadays compared how it was in the old days when you had all the devices on the same bus.

                    1 Reply Last reply Reply Quote 0
                    • JaredBuschJ
                      JaredBusch
                      last edited by JaredBusch

                      So the customer asked me to spec out a replacement server.

                      This is what I am thinking to recommend.

                      Dell PowerEdge R6515 – Chassis with 8x 2.5” drives
                      AMD EPYC 7262 or 7302P 
                      1x 16gb RDIMM 3200MT
                      PERC H730P
                      3x 480GB SSD SATA Mix Use Hot plug
                      Dual hot plug power supply
                      Riser Config 1 1x16LP PCIe slot
                      iDRAC 9 Express
                      BOSS controller card with 2 M.2 240GB RAID 1
                      

                      Comments?

                      DashrenderD 1 JaredBuschJ 3 Replies Last reply Reply Quote 2
                      • scottalanmillerS
                        scottalanmiller
                        last edited by

                        Seems like anything will work in this scenario given how old the original was. What's the workload?

                        JaredBuschJ 2 Replies Last reply Reply Quote 1
                        • JaredBuschJ
                          JaredBusch @scottalanmiller
                          last edited by JaredBusch

                          @scottalanmiller said in PCI bus error:

                          Seems like anything will work in this scenario given how old the original was. What's the workload?

                          A proprietary system from TopTech

                          Server load is nothing normally. The system is catching up from a planned maintenance window at the moment.
                          a6ac8ef6-a47b-4f05-8623-f3abee52d37c-image.png
                          c911289b-d704-489b-a415-0c47e443006c-image.png

                          I'll get another snapshot once it is caught up.

                          1 Reply Last reply Reply Quote 0
                          • JaredBuschJ
                            JaredBusch @scottalanmiller
                            last edited by

                            @scottalanmiller said in PCI bus error:

                            Seems like anything will work in this scenario given how old the original was.

                            I am future planning. The system will get replaced by a new version.

                            But that requires infrastructure updates at the terminals also.

                            scottalanmillerS 1 Reply Last reply Reply Quote 0
                            • scottalanmillerS
                              scottalanmiller @JaredBusch
                              last edited by

                              @JaredBusch said in PCI bus error:

                              @scottalanmiller said in PCI bus error:

                              Seems like anything will work in this scenario given how old the original was.

                              I am future planning. The system will get replaced by a new version.

                              But that requires infrastructure updates at the terminals also.

                              Well sure, but even the smallest modern system will be orders of magnitude faster. Hard to believe anything wouldn't have the "oomph" for the task unless the workload isn't just updated, but overhauled.

                              JaredBuschJ 1 Reply Last reply Reply Quote 0
                              • JaredBuschJ
                                JaredBusch @scottalanmiller
                                last edited by

                                @scottalanmiller said in PCI bus error:

                                Well sure, but even the smallest modern system will be orders of magnitude faster. Hard to believe anything wouldn't have the "oomph" for the task unless the workload isn't just updated, but overhauled.

                                Right the workload will not change. That is pretty consistent. The specs for the new version are higher. But still, yes, anything modern will power it.

                                1 Reply Last reply Reply Quote 0
                                • JaredBuschJ
                                  JaredBusch
                                  last edited by

                                  Yeah it sleeps all day long..

                                  8799298e-ab9d-4111-89c2-31a82d3737cb-image.png

                                  1 Reply Last reply Reply Quote 0
                                  • DashrenderD
                                    Dashrender @JaredBusch
                                    last edited by

                                    @JaredBusch said in PCI bus error:

                                    So the customer asked me to spec out a replacement server.

                                    This is what I am thinking to recommend.

                                    Dell PowerEdge R6515 – Chassis with 8x 2.5” drives
                                    AMD EPYC 7262 or 7302P 
                                    1x 16gb RDIMM 3200MT
                                    PERC H730P
                                    3x 480GB SSD SATA Mix Use Hot plug
                                    Dual hot plug power supply
                                    Riser Config 1 1x16LP PCIe slot
                                    iDRAC 9 Express
                                    BOSS controller card with 2 M.2 240GB RAID 1
                                    

                                    Comments?

                                    Does iDRAC 9 Express allow remote access to the console?

                                    1 JaredBuschJ 2 Replies Last reply Reply Quote 0
                                    • 1
                                      1337 @JaredBusch
                                      last edited by 1337

                                      @JaredBusch said in PCI bus error:

                                      So the customer asked me to spec out a replacement server.

                                      This is what I am thinking to recommend.

                                      Dell PowerEdge R6515 – Chassis with 8x 2.5” drives
                                      AMD EPYC 7262 or 7302P 
                                      1x 16gb RDIMM 3200MT
                                      PERC H730P
                                      3x 480GB SSD SATA Mix Use Hot plug
                                      Dual hot plug power supply
                                      Riser Config 1 1x16LP PCIe slot
                                      iDRAC 9 Express
                                      BOSS controller card with 2 M.2 240GB RAID 1
                                      

                                      Comments?

                                      Yeah, I assume this is a low budget spec.

                                      Pick the cheapest epyc rome unless you expect the server to handle lots more in the future. 7232P is the cheapest.
                                      Also skip the BOSS card and pick 2x960GB read-intensive drives in RAID 1. Since you have the H730P RAID1 and it's cache, RAID1 should be more than fine.

                                      I mean comparing to the old machine you could also use the H330 card. You don't get the cache but the SSDs have cache and RAID1/10 doesn't require any parity calculations so the H330 will get the job done.

                                      JaredBuschJ 1 Reply Last reply Reply Quote 0
                                      • 1
                                        1337 @Dashrender
                                        last edited by

                                        @Dashrender said in PCI bus error:

                                        @JaredBusch said in PCI bus error:

                                        So the customer asked me to spec out a replacement server.

                                        This is what I am thinking to recommend.

                                        Dell PowerEdge R6515 – Chassis with 8x 2.5” drives
                                        AMD EPYC 7262 or 7302P 
                                        1x 16gb RDIMM 3200MT
                                        PERC H730P
                                        3x 480GB SSD SATA Mix Use Hot plug
                                        Dual hot plug power supply
                                        Riser Config 1 1x16LP PCIe slot
                                        iDRAC 9 Express
                                        BOSS controller card with 2 M.2 240GB RAID 1
                                        

                                        Comments?

                                        Does iDRAC 9 Express allow remote access to the console?

                                        I don't think so. You can only do power cycling with Express.

                                        1 Reply Last reply Reply Quote 0
                                        • JaredBuschJ
                                          JaredBusch @Dashrender
                                          last edited by

                                          @Dashrender said in PCI bus error:

                                          Does iDRAC 9 Express allow remote access to the console?

                                          Forgot to change that. I just copy/pasted form the dell build screen.

                                          1 Reply Last reply Reply Quote 0
                                          • JaredBuschJ
                                            JaredBusch @1337
                                            last edited by

                                            @Pete-S said in PCI bus error:

                                            Yeah, I assume this is a low budget spec.
                                            Pick the cheapest epyc rome unless you expect the server to handle lots more in the future. 7232P is the cheapest.

                                            Budget is not an issue. But it does not need anything bigger.

                                            The two specified, were the smallest two on Dell's website.

                                            1 1 Reply Last reply Reply Quote 0
                                            • 1
                                            • 2
                                            • 3
                                            • 1 / 3
                                            • First post
                                              Last post