Windows Vista Forums

Hyper-v cluster guests loosing network access and or hosts getting blue screen :-(
  1. #1


    Tiago Lock Martins Guest

    Hyper-v cluster guests loosing network access and or hosts getting blue screen :-(

    Hello,

    I`m having some issues on my Hyper-V environment.

    My environment:

    Windows 2008 R2 Enterprise Edition running on Dell PowerEdge R710 with
    32GB RAM.
    3 node cluster set up with quorum disk, have default type cluster disks
    and CSV disks.
    1 NIC host dedicated, 1 NIC set to trunk with two VLANs attached to
    virtual switch 2, 2 NICs as link aggregation attached to virtual switch 1
    2 fiber channel HBAs attached to Brocade switches and Dell EMC CX300
    storage.
    Around 52 LUNs associated to the cluster
    Some LUNs contains VHDs and others are RAW disks
    All guests NICs are synthetics

    When I had a 2 node cluster, some guests just loose network connection,
    even removing virtual switch connection with the guest NIC and attaching it
    back, doesn`t resolves the problem, need to restart the guest to get back to
    network. Sometimes the guests just hang.

    After adding a 3rd node to the cluster, the hosts started to restart
    after blue screens ( got Overlapped I/O message on 1 of them ), sometimes
    the host don`t restart but it simply restarts the guests running on it.

    I`ve been searching for 1 week already and didn`t found nothing that
    really helps.

    Anyone have any clue?

    If need more info, please tell me. Will be happy to help you to help me
    :-D



    Thanks for your time



      My System SpecsSystem Spec

  2. #2


    RCan Guest

    Re: Hyper-v cluster guests loosing network access and or hosts getting blue screen :-(

    Hi Tiago,

    first of all did you had changed the quorum model when you added the 3rd
    node afterwards ? To verify, in your case the quorum model should be now
    "node majority".
    http://technet.microsoft.com/en-us/l...irementsNandFS

    can you also please share more details around your eventlogs and more
    important the cluster log which should be generated (cluster /log gen)
    shortly after this happens.

    Regards
    Ramazan

    "Tiago Lock Martins" <TLock@newsgroup> wrote in message
    news:68ADEA5E-7F0C-4E0D-8BB8-4108079E3654@newsgroup

    > Hello,
    >
    > I`m having some issues on my Hyper-V environment.
    >
    > My environment:
    >
    > Windows 2008 R2 Enterprise Edition running on Dell PowerEdge R710 with
    > 32GB RAM.
    > 3 node cluster set up with quorum disk, have default type cluster disks
    > and CSV disks.
    > 1 NIC host dedicated, 1 NIC set to trunk with two VLANs attached to
    > virtual switch 2, 2 NICs as link aggregation attached to virtual switch 1
    > 2 fiber channel HBAs attached to Brocade switches and Dell EMC CX300
    > storage.
    > Around 52 LUNs associated to the cluster
    > Some LUNs contains VHDs and others are RAW disks
    > All guests NICs are synthetics
    >
    > When I had a 2 node cluster, some guests just loose network connection,
    > even removing virtual switch connection with the guest NIC and attaching
    > it
    > back, doesn`t resolves the problem, need to restart the guest to get back
    > to
    > network. Sometimes the guests just hang.
    >
    > After adding a 3rd node to the cluster, the hosts started to restart
    > after blue screens ( got Overlapped I/O message on 1 of them ), sometimes
    > the host don`t restart but it simply restarts the guests running on it.
    >
    > I`ve been searching for 1 week already and didn`t found nothing that
    > really helps.
    >
    > Anyone have any clue?
    >
    > If need more info, please tell me. Will be happy to help you to help me
    > :-D
    >
    > Thanks for your time
    >
    >

      My System SpecsSystem Spec

  3. #3


    RCan Guest

    Re: Hyper-v cluster guests loosing network access and or hosts getting blue screen :-(

    +1

    Hi Tiago,

    first of all did you had changed the quorum model when you added the 3rd
    node afterwards ? To verify, in your case the quorum model should be now
    "node majority".
    http://technet.microsoft.com/en-us/l...irementsNandFS

    can you also please share more details around your eventlogs and more
    important the cluster log which should be generated (cluster /log gen)
    shortly after this happens.

    Regards
    Ramazan

    "Tiago Lock Martins" <TLock@newsgroup> wrote in message
    news:68ADEA5E-7F0C-4E0D-8BB8-4108079E3654@newsgroup

    > Hello,
    >
    > I`m having some issues on my Hyper-V environment.
    >
    > My environment:
    >
    > Windows 2008 R2 Enterprise Edition running on Dell PowerEdge R710 with
    > 32GB RAM.
    > 3 node cluster set up with quorum disk, have default type cluster disks
    > and CSV disks.
    > 1 NIC host dedicated, 1 NIC set to trunk with two VLANs attached to
    > virtual switch 2, 2 NICs as link aggregation attached to virtual switch 1
    > 2 fiber channel HBAs attached to Brocade switches and Dell EMC CX300
    > storage.
    > Around 52 LUNs associated to the cluster
    > Some LUNs contains VHDs and others are RAW disks
    > All guests NICs are synthetics
    >
    > When I had a 2 node cluster, some guests just loose network connection,
    > even removing virtual switch connection with the guest NIC and attaching
    > it
    > back, doesn`t resolves the problem, need to restart the guest to get back
    > to
    > network. Sometimes the guests just hang.
    >
    > After adding a 3rd node to the cluster, the hosts started to restart
    > after blue screens ( got Overlapped I/O message on 1 of them ), sometimes
    > the host don`t restart but it simply restarts the guests running on it.
    >
    > I`ve been searching for 1 week already and didn`t found nothing that
    > really helps.
    >
    > Anyone have any clue?
    >
    > If need more info, please tell me. Will be happy to help you to help me
    > :-D
    >
    > Thanks for your time
    >
    >

      My System SpecsSystem Spec

  4. #4


    Brad Bird(MVP) Guest

    Re: Hyper-v cluster guests loosing network access and or hosts getting blue screen :-(

    Hello Tiago,

    I am curious to know if the suggestions from RCan helped you resolve this.

    This sounds suspiciously like an issue I had at University of Ottawa. To
    band-aid the issue, I would console to the VM and (disable/enable) the NIC
    in the guest to reset the IP stack. This was on IBM servers.

    At the time, we thought this was due to Broadcom firmware from IBM information
    but the problem was never completely solved and since I don't work there
    anymore, I don't know to this day if it was...

    I realize your scenario is not the same nor do you have the same hardware.
    I hope the band-aid helps save you time if it resolves your issue faster
    than a reboot...

    Do you have any network stats being monitored? This is where I would start
    looking...

    > Hello,
    >
    > I`m having some issues on my Hyper-V environment.
    >
    > My environment:
    >
    > Windows 2008 R2 Enterprise Edition running on Dell PowerEdge R710
    > with
    > 32GB RAM.
    > 3 node cluster set up with quorum disk, have default type cluster
    > disks
    > and CSV disks.
    > 1 NIC host dedicated, 1 NIC set to trunk with two VLANs attached
    > to
    > virtual switch 2, 2 NICs as link aggregation attached to virtual
    > switch 1
    > 2 fiber channel HBAs attached to Brocade switches and Dell EMC
    > CX300
    > storage.
    > Around 52 LUNs associated to the cluster
    > Some LUNs contains VHDs and others are RAW disks
    > All guests NICs are synthetics
    > When I had a 2 node cluster, some guests just loose network
    > connection, even removing virtual switch connection with the guest NIC
    > and attaching it back, doesn`t resolves the problem, need to restart
    > the guest to get back to network. Sometimes the guests just hang.
    >
    > After adding a 3rd node to the cluster, the hosts started to
    > restart after blue screens ( got Overlapped I/O message on 1 of them
    > ), sometimes the host don`t restart but it simply restarts the guests
    > running on it.
    >
    > I`ve been searching for 1 week already and didn`t found nothing
    > that really helps.
    >
    > Anyone have any clue?
    >
    > If need more info, please tell me. Will be happy to help you to
    > help me :-D
    >
    > Thanks for your time
    >


      My System SpecsSystem Spec

  5. #5


    AndyS Guest

    Re: Hyper-v cluster guests loosing network access and or hosts get

    Hi folks

    This sounds like a problem that we're having and we are using Broadcom NICs.
    We are running Hyper-v on Win 2008R2 on a dell R610 and have Windows Server
    2003 SP2 guests. The symptoms of the problem are virtual server guests loose
    network connectivity randomly (once every week or so) and some perform so
    poorly after loosing network connectivity that they have to be forced to shut
    down rather than rebooted properly.

    From what information I can find there seems to be a link with Broadcom
    adapters. Some suggest disabling 'IPv4 Large Send Offload' on the physical
    adapters on the host which we have done, however we still get servers falling
    over. Another suggestion was to disable 'IPv4 Large Send Offload' on the
    guest virtual adapters (inside the guest Win server 2003 OS) but this caused
    servers to fall over every few hours. The only errors I can find before the
    guests loos network connectivity is 'Event ID 5 - The miniport 'Microsoft
    Virtual Machine Bus Network Adapter' hung.' followed by 'Event ID 4 - The
    miniport 'Microsoft Virtual Machine Bus Network Adapter' reset.'

    We have a call open with Microsoft regarding this issue but we haven't got
    very far.

    Cheers

    Andy

    "Brad Bird (MVP)" wrote:

    > Hello Tiago,
    >
    > I am curious to know if the suggestions from RCan helped you resolve this.
    >
    > This sounds suspiciously like an issue I had at University of Ottawa. To
    > band-aid the issue, I would console to the VM and (disable/enable) the NIC
    > in the guest to reset the IP stack. This was on IBM servers.
    >
    > At the time, we thought this was due to Broadcom firmware from IBM information
    > but the problem was never completely solved and since I don't work there
    > anymore, I don't know to this day if it was...
    >
    > I realize your scenario is not the same nor do you have the same hardware.
    > I hope the band-aid helps save you time if it resolves your issue faster
    > than a reboot...
    >
    > Do you have any network stats being monitored? This is where I would start
    > looking...
    >

    > > Hello,
    > >
    > > I`m having some issues on my Hyper-V environment.
    > >
    > > My environment:
    > >
    > > Windows 2008 R2 Enterprise Edition running on Dell PowerEdge R710
    > > with
    > > 32GB RAM.
    > > 3 node cluster set up with quorum disk, have default type cluster
    > > disks
    > > and CSV disks.
    > > 1 NIC host dedicated, 1 NIC set to trunk with two VLANs attached
    > > to
    > > virtual switch 2, 2 NICs as link aggregation attached to virtual
    > > switch 1
    > > 2 fiber channel HBAs attached to Brocade switches and Dell EMC
    > > CX300
    > > storage.
    > > Around 52 LUNs associated to the cluster
    > > Some LUNs contains VHDs and others are RAW disks
    > > All guests NICs are synthetics
    > > When I had a 2 node cluster, some guests just loose network
    > > connection, even removing virtual switch connection with the guest NIC
    > > and attaching it back, doesn`t resolves the problem, need to restart
    > > the guest to get back to network. Sometimes the guests just hang.
    > >
    > > After adding a 3rd node to the cluster, the hosts started to
    > > restart after blue screens ( got Overlapped I/O message on 1 of them
    > > ), sometimes the host don`t restart but it simply restarts the guests
    > > running on it.
    > >
    > > I`ve been searching for 1 week already and didn`t found nothing
    > > that really helps.
    > >
    > > Anyone have any clue?
    > >
    > > If need more info, please tell me. Will be happy to help you to
    > > help me :-D
    > >
    > > Thanks for your time
    > >
    >
    >
    > .
    >

      My System SpecsSystem Spec

  6. #6


    Tiago Lock Martins Guest

    Re: Hyper-v cluster guests loosing network access and or hosts get

    Hi Andys,

    Quote : " The only errors I can find before the

    > guests loos network connectivity is 'Event ID 5 - The miniport 'Microsoft
    > Virtual Machine Bus Network Adapter' hung.' followed by 'Event ID 4 - The
    > miniport 'Microsoft Virtual Machine Bus Network Adapter' reset.' "
    Same thing this side :-(

    Anyone knows why this happen ( or may cause this behavior ) ?

    Thanks for your time so far ppl.

    "AndyS" <AndyS@newsgroup> escreveu na notÃ*cia da
    mensagem:082BBE80-B544-42B4-B7DF-0A1CA8C071B3@newsgroup

    > Hi folks
    >
    > This sounds like a problem that we're having and we are using Broadcom
    > NICs.
    > We are running Hyper-v on Win 2008R2 on a dell R610 and have Windows
    > Server
    > 2003 SP2 guests. The symptoms of the problem are virtual server guests
    > loose
    > network connectivity randomly (once every week or so) and some perform so
    > poorly after loosing network connectivity that they have to be forced to
    > shut
    > down rather than rebooted properly.
    >
    > From what information I can find there seems to be a link with Broadcom
    > adapters. Some suggest disabling 'IPv4 Large Send Offload' on the
    > physical
    > adapters on the host which we have done, however we still get servers
    > falling
    > over. Another suggestion was to disable 'IPv4 Large Send Offload' on the
    > guest virtual adapters (inside the guest Win server 2003 OS) but this
    > caused
    > servers to fall over every few hours. The only errors I can find before
    > the
    > guests loos network connectivity is 'Event ID 5 - The miniport 'Microsoft
    > Virtual Machine Bus Network Adapter' hung.' followed by 'Event ID 4 - The
    > miniport 'Microsoft Virtual Machine Bus Network Adapter' reset.'
    >
    > We have a call open with Microsoft regarding this issue but we haven't got
    > very far.
    >
    > Cheers
    >
    > Andy
    >
    > "Brad Bird (MVP)" wrote:
    >

    >> Hello Tiago,
    >>
    >> I am curious to know if the suggestions from RCan helped you resolve
    >> this.
    >>
    >> This sounds suspiciously like an issue I had at University of Ottawa. To
    >> band-aid the issue, I would console to the VM and (disable/enable) the
    >> NIC
    >> in the guest to reset the IP stack. This was on IBM servers.
    >>
    >> At the time, we thought this was due to Broadcom firmware from IBM
    >> information
    >> but the problem was never completely solved and since I don't work there
    >> anymore, I don't know to this day if it was...
    >>
    >> I realize your scenario is not the same nor do you have the same
    >> hardware.
    >> I hope the band-aid helps save you time if it resolves your issue faster
    >> than a reboot...
    >>
    >> Do you have any network stats being monitored? This is where I would
    >> start
    >> looking...
    >>

    >> > Hello,
    >> >
    >> > I`m having some issues on my Hyper-V environment.
    >> >
    >> > My environment:
    >> >
    >> > Windows 2008 R2 Enterprise Edition running on Dell PowerEdge R710
    >> > with
    >> > 32GB RAM.
    >> > 3 node cluster set up with quorum disk, have default type cluster
    >> > disks
    >> > and CSV disks.
    >> > 1 NIC host dedicated, 1 NIC set to trunk with two VLANs attached
    >> > to
    >> > virtual switch 2, 2 NICs as link aggregation attached to virtual
    >> > switch 1
    >> > 2 fiber channel HBAs attached to Brocade switches and Dell EMC
    >> > CX300
    >> > storage.
    >> > Around 52 LUNs associated to the cluster
    >> > Some LUNs contains VHDs and others are RAW disks
    >> > All guests NICs are synthetics
    >> > When I had a 2 node cluster, some guests just loose network
    >> > connection, even removing virtual switch connection with the guest NIC
    >> > and attaching it back, doesn`t resolves the problem, need to restart
    >> > the guest to get back to network. Sometimes the guests just hang.
    >> >
    >> > After adding a 3rd node to the cluster, the hosts started to
    >> > restart after blue screens ( got Overlapped I/O message on 1 of them
    >> > ), sometimes the host don`t restart but it simply restarts the guests
    >> > running on it.
    >> >
    >> > I`ve been searching for 1 week already and didn`t found nothing
    >> > that really helps.
    >> >
    >> > Anyone have any clue?
    >> >
    >> > If need more info, please tell me. Will be happy to help you to
    >> > help me :-D
    >> >
    >> > Thanks for your time
    >> >
    >>
    >>
    >> .
    >>

      My System SpecsSystem Spec

  7. #7


    RCan Guest

    Re: Hyper-v cluster guests loosing network access and or hosts getting blue screen :-(

    Hi Tiago,

    let me know about the results of your cluster logs. Mainly this is related
    to network communication issues between nodes. Cross-check your network
    config from MS best practice perspective.

    Regards
    Ramazan

    "Tiago Lock Martins" <TLock@newsgroup> wrote in message
    news:6B64CCD1-5C0F-4172-9BF0-21A2D0F4BD16@newsgroup

    > Hi Brad,
    >
    > Will see que results of cluster /log gen when the issue happens again.
    >
    > Regarding disabling/enabling the guest NIC doesn`t repair the conectivity
    > :-(
    >
    > Glad to see other thoughs.
    >
    > "Brad Bird (MVP)" <brad@newsgroup> escreveu na notícia da
    > mensagem:c9743e005f0f68cc9fa820ccac1b@newsgroup

    >> Hello Tiago,
    >>
    >> I am curious to know if the suggestions from RCan helped you resolve
    >> this.
    >>
    >> This sounds suspiciously like an issue I had at University of Ottawa. To
    >> band-aid the issue, I would console to the VM and (disable/enable) the
    >> NIC in the guest to reset the IP stack. This was on IBM servers.
    >>
    >> At the time, we thought this was due to Broadcom firmware from IBM
    >> information but the problem was never completely solved and since I don't
    >> work there anymore, I don't know to this day if it was...
    >>
    >> I realize your scenario is not the same nor do you have the same
    >> hardware. I hope the band-aid helps save you time if it resolves your
    >> issue faster than a reboot...
    >>
    >> Do you have any network stats being monitored? This is where I would
    >> start looking...
    >>

    >>> Hello,
    >>>
    >>> I`m having some issues on my Hyper-V environment.
    >>>
    >>> My environment:
    >>>
    >>> Windows 2008 R2 Enterprise Edition running on Dell PowerEdge R710
    >>> with
    >>> 32GB RAM.
    >>> 3 node cluster set up with quorum disk, have default type cluster
    >>> disks
    >>> and CSV disks.
    >>> 1 NIC host dedicated, 1 NIC set to trunk with two VLANs attached
    >>> to
    >>> virtual switch 2, 2 NICs as link aggregation attached to virtual
    >>> switch 1
    >>> 2 fiber channel HBAs attached to Brocade switches and Dell EMC
    >>> CX300
    >>> storage.
    >>> Around 52 LUNs associated to the cluster
    >>> Some LUNs contains VHDs and others are RAW disks
    >>> All guests NICs are synthetics
    >>> When I had a 2 node cluster, some guests just loose network
    >>> connection, even removing virtual switch connection with the guest NIC
    >>> and attaching it back, doesn`t resolves the problem, need to restart
    >>> the guest to get back to network. Sometimes the guests just hang.
    >>>
    >>> After adding a 3rd node to the cluster, the hosts started to
    >>> restart after blue screens ( got Overlapped I/O message on 1 of them
    >>> ), sometimes the host don`t restart but it simply restarts the guests
    >>> running on it.
    >>>
    >>> I`ve been searching for 1 week already and didn`t found nothing
    >>> that really helps.
    >>>
    >>> Anyone have any clue?
    >>>
    >>> If need more info, please tell me. Will be happy to help you to
    >>> help me :-D
    >>>
    >>> Thanks for your time
    >>>
    >>
    >>

      My System SpecsSystem Spec

Hyper-v cluster guests loosing network access and or hosts getting blue screen :-( problems?

Similar Threads
Thread Thread Starter Forum Replies Last Post
hyper-v host cannot ping guests and vice-versa IT STAFF Virtual Server 3 25 Mar 2010
Hyper-v Cluster jar Virtual Server 9 28 Jul 2009
HyperV - Guests can ping hosts but not outside... Marcus Robinson Virtual Server 1 17 Jul 2009
backup hyper-v guests using dpm IT Staff Virtual Server 0 03 Mar 2009
Virtual Domain controller guests on Hyper-V Clay Virtual Server 5 20 Dec 2008