ML
    • Recent
    • Categories
    • Tags
    • Popular
    • Users
    • Groups
    • Register
    • Login

    Overheating NICs in SuperMicro on FreeBSD

    IT Discussion
    freenas network supermicro freebsd
    7
    21
    3.5k
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • DashrenderD
      Dashrender
      last edited by

      Wow - nice find.

      Warranty?

      scottalanmillerS 1 Reply Last reply Reply Quote 0
      • scottalanmillerS
        scottalanmiller @Dashrender
        last edited by

        @Dashrender said in Overheating NICs in SuperMicro on FreeBSD:

        Wow - nice find.

        Warranty?

        Very old gear, I would guess not.

        1 Reply Last reply Reply Quote 0
        • DashrenderD
          Dashrender
          last edited by

          yeah after I posted that I also was wondering, perhaps not a card issue, but a system heat dissipation problem, like bad fans, etc.

          scottalanmillerS 1 Reply Last reply Reply Quote 1
          • momurdaM
            momurda
            last edited by

            Dang that is crazy. Could possibly find the chips on the mobo or card and put a heat spreader on it, like the ones they use for video card ram.

            1 Reply Last reply Reply Quote 1
            • scottalanmillerS
              scottalanmiller @Dashrender
              last edited by

              @Dashrender said in Overheating NICs in SuperMicro on FreeBSD:

              yeah after I posted that I also was wondering, perhaps not a card issue, but a system heat dissipation problem, like bad fans, etc.

              That's what we are guessing. It's in a clean, dedicated server room that is an even 72 degrees. No HVAC failures. Nothing blocking the air flow.

              1 Reply Last reply Reply Quote 0
              • scottalanmillerS
                scottalanmiller
                last edited by

                OMG, they got on the IPMI and internal sensors peg the system at over 115C!! Holy crap. Never seen a server get that hot and come back from it. Amazing that those disks still spin. They've got the vendor on the phone right now getting them to look into it.

                gjacobseG 1 Reply Last reply Reply Quote 0
                • DashrenderD
                  Dashrender
                  last edited by

                  Holy cats!

                  1 Reply Last reply Reply Quote 0
                  • gjacobseG
                    gjacobse @scottalanmiller
                    last edited by

                    @scottalanmiller said in Overheating NICs in SuperMicro on FreeBSD:

                    OMG, they got on the IPMI and internal sensors peg the system at over 115C!! Holy crap. Never seen a server get that hot and come back from it. Amazing that those disks still spin. They've got the vendor on the phone right now getting them to look into it.

                    They are pushing the limit on the smoke valve... jeeze.

                    1 Reply Last reply Reply Quote 0
                    • thwrT
                      thwr
                      last edited by

                      Onboard NICs? Strange

                      scottalanmillerS 1 Reply Last reply Reply Quote 0
                      • momurdaM
                        momurda
                        last edited by

                        It could just be a problem with reporting, and not actually be that hot. I hope. Cuz 115 is melty time.

                        thwrT scottalanmillerS 2 Replies Last reply Reply Quote 3
                        • thwrT
                          thwr @momurda
                          last edited by

                          @momurda said in Overheating NICs in SuperMicro on FreeBSD:

                          It could just be a problem with reporting, and not actually be that hot. I hope. Cuz 115 is melty time.

                          Most ICs should survive this. AMD had a range of GPUs reaching 95° - during regular use with no OC involved. But 115°C is a lot.

                          Broken sensors / reporting could be a reason, good point. @scottalanmiller: Got an infrared camera?

                          scottalanmillerS 1 Reply Last reply Reply Quote 1
                          • scottalanmillerS
                            scottalanmiller @thwr
                            last edited by

                            @thwr said in Overheating NICs in SuperMicro on FreeBSD:

                            Onboard NICs? Strange

                            Yup, on board 10GigE. Getting more and more common these days.

                            1 Reply Last reply Reply Quote 0
                            • scottalanmillerS
                              scottalanmiller @momurda
                              last edited by

                              @momurda said in Overheating NICs in SuperMicro on FreeBSD:

                              It could just be a problem with reporting, and not actually be that hot. I hope. Cuz 115 is melty time.

                              Close but not quite. And we have several reasons to believe that it is really that hot. But false reporting is still a possibility.

                              1 Reply Last reply Reply Quote 0
                              • scottalanmillerS
                                scottalanmiller @thwr
                                last edited by

                                @thwr said in Overheating NICs in SuperMicro on FreeBSD:

                                @momurda said in Overheating NICs in SuperMicro on FreeBSD:

                                It could just be a problem with reporting, and not actually be that hot. I hope. Cuz 115 is melty time.

                                Most ICs should survive this. AMD had a range of GPUs reaching 95° - during regular use with no OC involved. But 115°C is a lot.

                                Broken sensors / reporting could be a reason, good point. @scottalanmiller: Got an infrared camera?

                                It's not holding that temp, only spiking to it once a day or less.

                                dafyreD thwrT 2 Replies Last reply Reply Quote 0
                                • dafyreD
                                  dafyre @scottalanmiller
                                  last edited by

                                  @scottalanmiller said in Overheating NICs in SuperMicro on FreeBSD:

                                  @thwr said in Overheating NICs in SuperMicro on FreeBSD:

                                  @momurda said in Overheating NICs in SuperMicro on FreeBSD:

                                  It could just be a problem with reporting, and not actually be that hot. I hope. Cuz 115 is melty time.

                                  Most ICs should survive this. AMD had a range of GPUs reaching 95° - during regular use with no OC involved. But 115°C is a lot.

                                  Broken sensors / reporting could be a reason, good point. @scottalanmiller: Got an infrared camera?

                                  It's not holding that temp, only spiking to it once a day or less.

                                  Any correlation between network traffic and the temperature spikes? Does it happen at the same time every day, etc, etc?

                                  scottalanmillerS 1 Reply Last reply Reply Quote 0
                                  • thwrT
                                    thwr @scottalanmiller
                                    last edited by

                                    @scottalanmiller said in Overheating NICs in SuperMicro on FreeBSD:

                                    @thwr said in Overheating NICs in SuperMicro on FreeBSD:

                                    @momurda said in Overheating NICs in SuperMicro on FreeBSD:

                                    It could just be a problem with reporting, and not actually be that hot. I hope. Cuz 115 is melty time.

                                    Most ICs should survive this. AMD had a range of GPUs reaching 95° - during regular use with no OC involved. But 115°C is a lot.

                                    Broken sensors / reporting could be a reason, good point. @scottalanmiller: Got an infrared camera?

                                    It's not holding that temp, only spiking to it once a day or less.

                                    Just talked to a friend who is much more into soldering etc than me. He said that 150°C is a temperature to look at, because the so-called https://en.wikipedia.org/wiki/Glass_transition might come into effect. As for the IC itself, there's a "https://en.wikipedia.org/wiki/Junction_temperature" to keep an eye on. Both are related more or less.

                                    1 Reply Last reply Reply Quote 0
                                    • scottalanmillerS
                                      scottalanmiller @dafyre
                                      last edited by

                                      @dafyre said in Overheating NICs in SuperMicro on FreeBSD:

                                      @scottalanmiller said in Overheating NICs in SuperMicro on FreeBSD:

                                      @thwr said in Overheating NICs in SuperMicro on FreeBSD:

                                      @momurda said in Overheating NICs in SuperMicro on FreeBSD:

                                      It could just be a problem with reporting, and not actually be that hot. I hope. Cuz 115 is melty time.

                                      Most ICs should survive this. AMD had a range of GPUs reaching 95° - during regular use with no OC involved. But 115°C is a lot.

                                      Broken sensors / reporting could be a reason, good point. @scottalanmiller: Got an infrared camera?

                                      It's not holding that temp, only spiking to it once a day or less.

                                      Any correlation between network traffic and the temperature spikes? Does it happen at the same time every day, etc, etc?

                                      Yes, appears to be loosely related.

                                      1 Reply Last reply Reply Quote 0
                                      • scottalanmillerS
                                        scottalanmiller
                                        last edited by

                                        Getting the entire motherboard replaced straight away.

                                        1 Reply Last reply Reply Quote 0
                                        • scottalanmillerS
                                          scottalanmiller
                                          last edited by

                                          Breaking the LAG to potentially reduce flips and load on the NICs. @Mike-Davis

                                          1 Reply Last reply Reply Quote 1
                                          • Mike DavisM
                                            Mike Davis
                                            last edited by

                                            It was amazing that Scott found it so fast. I was on the Windows side of things. Inside Windows they were using the iSCSI initiator to connect to the FreeNAS. All the sudden Windows would just log a ton of iSCSI events and go down.

                                            I looked up the events and most people resolved them by putting the iSCSI traffic on a separate NIC. This happened two days in a row at about the same time each day. I was looking at snapshot, backup, etc times when Scott found it in the FreeNAS logs.

                                            1 Reply Last reply Reply Quote 1
                                            • 1
                                            • 2
                                            • 1 / 2
                                            • First post
                                              Last post