ML
    • Recent
    • Categories
    • Tags
    • Popular
    • Users
    • Groups
    • Register
    • Login

    Solved PCI bus error

    IT Discussion
    dell poweredge poweredge 2850 lspci bmc ipmi
    10
    55
    3.7k
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • ObsolesceO
      Obsolesce @JaredBusch
      last edited by

      @JaredBusch said in PCI bus error:

      So, no one can hep me confirm that the NIC is failing?

      Also, anything I can look at to see if this is the onboard NIC or the NIC on a separate card?

      I'll be on site on Tuesday.

      Did not see any info indicating NIC failure. No idea what device 6 is.

      Looking up the Intel NIC u linked shows this:

      Screenshot_20201213-113521_Edge.jpg

      1 Reply Last reply Reply Quote 0
      • 1
        1337
        last edited by 1337

        No, it's not the NIC.

        It says PCIe error bus 0, device 6, function 0.

        That's 00:06.0, and that is the PCI bridge E7520. Which I think is connected directly to the chipset on the CPU. Can't remember exactly what was on the CPU and the chipset back in those days.

        Either way the motherboard/CPU is done.

        Or I guess technically speaking a driver error caused by drive corruption could have caused the same error. After all it's the OS that gives the error message here.

        JaredBuschJ 2 Replies Last reply Reply Quote 3
        • JaredBuschJ
          JaredBusch @1337
          last edited by

          @Pete-S said in PCI bus error:

          After all it's the OS that gives the error message here.

          No the error message if from the BMC (predecessor to iDRAC).

          1 Reply Last reply Reply Quote 0
          • JaredBuschJ
            JaredBusch @1337
            last edited by

            @Pete-S said in PCI bus error:

            No, it's not the NIC.
            It says PCIe error bus 0, device 6, function 0.

            That is why I wanted others to look. The way I read the man page it seemed that the bus was omitted when using lspci.

            1 1 Reply Last reply Reply Quote 0
            • 1
              1337 @JaredBusch
              last edited by

              @JaredBusch said in PCI bus error:

              @Pete-S said in PCI bus error:

              No, it's not the NIC.
              

              It says PCIe error bus 0, device 6, function 0.

              That is why I wanted others to look. The way I read the man page it seemed that the bus was omitted when using lspci.

              No its <bus>:<device>.<func>

              But it's a bit confusing nowadays compared how it was in the old days when you had all the devices on the same bus.

              1 Reply Last reply Reply Quote 0
              • JaredBuschJ
                JaredBusch
                last edited by JaredBusch

                So the customer asked me to spec out a replacement server.

                This is what I am thinking to recommend.

                Dell PowerEdge R6515 – Chassis with 8x 2.5” drives
                AMD EPYC 7262 or 7302P 
                1x 16gb RDIMM 3200MT
                PERC H730P
                3x 480GB SSD SATA Mix Use Hot plug
                Dual hot plug power supply
                Riser Config 1 1x16LP PCIe slot
                iDRAC 9 Express
                BOSS controller card with 2 M.2 240GB RAID 1
                

                Comments?

                DashrenderD 1 JaredBuschJ 3 Replies Last reply Reply Quote 2
                • scottalanmillerS
                  scottalanmiller
                  last edited by

                  Seems like anything will work in this scenario given how old the original was. What's the workload?

                  JaredBuschJ 2 Replies Last reply Reply Quote 1
                  • JaredBuschJ
                    JaredBusch @scottalanmiller
                    last edited by JaredBusch

                    @scottalanmiller said in PCI bus error:

                    Seems like anything will work in this scenario given how old the original was. What's the workload?

                    A proprietary system from TopTech

                    Server load is nothing normally. The system is catching up from a planned maintenance window at the moment.
                    a6ac8ef6-a47b-4f05-8623-f3abee52d37c-image.png
                    c911289b-d704-489b-a415-0c47e443006c-image.png

                    I'll get another snapshot once it is caught up.

                    1 Reply Last reply Reply Quote 0
                    • JaredBuschJ
                      JaredBusch @scottalanmiller
                      last edited by

                      @scottalanmiller said in PCI bus error:

                      Seems like anything will work in this scenario given how old the original was.

                      I am future planning. The system will get replaced by a new version.

                      But that requires infrastructure updates at the terminals also.

                      scottalanmillerS 1 Reply Last reply Reply Quote 0
                      • scottalanmillerS
                        scottalanmiller @JaredBusch
                        last edited by

                        @JaredBusch said in PCI bus error:

                        @scottalanmiller said in PCI bus error:

                        Seems like anything will work in this scenario given how old the original was.

                        I am future planning. The system will get replaced by a new version.

                        But that requires infrastructure updates at the terminals also.

                        Well sure, but even the smallest modern system will be orders of magnitude faster. Hard to believe anything wouldn't have the "oomph" for the task unless the workload isn't just updated, but overhauled.

                        JaredBuschJ 1 Reply Last reply Reply Quote 0
                        • JaredBuschJ
                          JaredBusch @scottalanmiller
                          last edited by

                          @scottalanmiller said in PCI bus error:

                          Well sure, but even the smallest modern system will be orders of magnitude faster. Hard to believe anything wouldn't have the "oomph" for the task unless the workload isn't just updated, but overhauled.

                          Right the workload will not change. That is pretty consistent. The specs for the new version are higher. But still, yes, anything modern will power it.

                          1 Reply Last reply Reply Quote 0
                          • JaredBuschJ
                            JaredBusch
                            last edited by

                            Yeah it sleeps all day long..

                            8799298e-ab9d-4111-89c2-31a82d3737cb-image.png

                            1 Reply Last reply Reply Quote 0
                            • DashrenderD
                              Dashrender @JaredBusch
                              last edited by

                              @JaredBusch said in PCI bus error:

                              So the customer asked me to spec out a replacement server.

                              This is what I am thinking to recommend.

                              Dell PowerEdge R6515 – Chassis with 8x 2.5” drives
                              AMD EPYC 7262 or 7302P 
                              1x 16gb RDIMM 3200MT
                              PERC H730P
                              3x 480GB SSD SATA Mix Use Hot plug
                              Dual hot plug power supply
                              Riser Config 1 1x16LP PCIe slot
                              iDRAC 9 Express
                              BOSS controller card with 2 M.2 240GB RAID 1
                              

                              Comments?

                              Does iDRAC 9 Express allow remote access to the console?

                              1 JaredBuschJ 2 Replies Last reply Reply Quote 0
                              • 1
                                1337 @JaredBusch
                                last edited by 1337

                                @JaredBusch said in PCI bus error:

                                So the customer asked me to spec out a replacement server.

                                This is what I am thinking to recommend.

                                Dell PowerEdge R6515 – Chassis with 8x 2.5” drives
                                AMD EPYC 7262 or 7302P 
                                1x 16gb RDIMM 3200MT
                                PERC H730P
                                3x 480GB SSD SATA Mix Use Hot plug
                                Dual hot plug power supply
                                Riser Config 1 1x16LP PCIe slot
                                iDRAC 9 Express
                                BOSS controller card with 2 M.2 240GB RAID 1
                                

                                Comments?

                                Yeah, I assume this is a low budget spec.

                                Pick the cheapest epyc rome unless you expect the server to handle lots more in the future. 7232P is the cheapest.
                                Also skip the BOSS card and pick 2x960GB read-intensive drives in RAID 1. Since you have the H730P RAID1 and it's cache, RAID1 should be more than fine.

                                I mean comparing to the old machine you could also use the H330 card. You don't get the cache but the SSDs have cache and RAID1/10 doesn't require any parity calculations so the H330 will get the job done.

                                JaredBuschJ 1 Reply Last reply Reply Quote 0
                                • 1
                                  1337 @Dashrender
                                  last edited by

                                  @Dashrender said in PCI bus error:

                                  @JaredBusch said in PCI bus error:

                                  So the customer asked me to spec out a replacement server.

                                  This is what I am thinking to recommend.

                                  Dell PowerEdge R6515 – Chassis with 8x 2.5” drives
                                  AMD EPYC 7262 or 7302P 
                                  1x 16gb RDIMM 3200MT
                                  PERC H730P
                                  3x 480GB SSD SATA Mix Use Hot plug
                                  Dual hot plug power supply
                                  Riser Config 1 1x16LP PCIe slot
                                  iDRAC 9 Express
                                  BOSS controller card with 2 M.2 240GB RAID 1
                                  

                                  Comments?

                                  Does iDRAC 9 Express allow remote access to the console?

                                  I don't think so. You can only do power cycling with Express.

                                  1 Reply Last reply Reply Quote 0
                                  • JaredBuschJ
                                    JaredBusch @Dashrender
                                    last edited by

                                    @Dashrender said in PCI bus error:

                                    Does iDRAC 9 Express allow remote access to the console?

                                    Forgot to change that. I just copy/pasted form the dell build screen.

                                    1 Reply Last reply Reply Quote 0
                                    • JaredBuschJ
                                      JaredBusch @1337
                                      last edited by

                                      @Pete-S said in PCI bus error:

                                      Yeah, I assume this is a low budget spec.
                                      Pick the cheapest epyc rome unless you expect the server to handle lots more in the future. 7232P is the cheapest.

                                      Budget is not an issue. But it does not need anything bigger.

                                      The two specified, were the smallest two on Dell's website.

                                      1 1 Reply Last reply Reply Quote 0
                                      • JaredBuschJ
                                        JaredBusch
                                        last edited by JaredBusch

                                        So restored the backup from 2012 onto a Proxmox VM on my laptop.

                                        297ba20f-fb70-4071-b5ec-e2065707ed41-image.png

                                        Damnit...

                                        I know this backup set is good as I did it in 2012. At the time on VMWare 5. I have also restore to physical in 2015.

                                        DustinB3403D 1 Reply Last reply Reply Quote 0
                                        • DustinB3403D
                                          DustinB3403 @JaredBusch
                                          last edited by

                                          @JaredBusch do you routinely expect 8 year old backups to operate on today's hypervisors?

                                          JaredBuschJ 1 Reply Last reply Reply Quote 0
                                          • DustinB3403D
                                            DustinB3403
                                            last edited by

                                            Pages 3 and 4 of this PDF list what iDrac offers for the different versions. Virtual Console is not included with iDrac Express

                                            JaredBuschJ 1 Reply Last reply Reply Quote 0
                                            • 1
                                            • 2
                                            • 3
                                            • 1 / 3
                                            • First post
                                              Last post