Jump to content
Welcome to our new Citrix community!
  • 1

Servers randomly crashing


Question

We have randomly crashes of our provisioned RDS running Server 2016 (1607).

 

Environment:

 

Citrix XenDesktop 7.15 CU3

Server 2016 Build 1607 (24 vCPU, 90GB RAM, ~30-35 sessions per RDS)

Hyper-V 2016 (S2D) as Hypervisor

Citrix Provisioning Services 7.15 CU3

Citrix Workspace Environment 18.08 with enabled memory and cpu management

 

What we've done:

 

Eventlogs do not show any useable entry 

CDF Tracing on PVS does not show any entry on reboot-times

New installation of Server 2016 from scratch mit last MS Feb CU 

Only an entry in Hyper-V Clusterlog that showes, that Hyper-V has hard rebootet the VM:

(sorry only in german available)

 

""

Fehler in der Clusterressource "XXXXXXXXXXXX" des Typs "Virtual Machine" in der Clusterrolle "XXXXXXXX Resources".

Abhängig von den Fehlerrichtlinien für die Ressource und die Rolle wird vom Clusterdienst möglicherweise versucht, die Ressource auf diesem Knoten online zu schalten oder die Gruppe auf einen anderen Knoten des Clusters zu verschieben und die Ressource dann neu zu starten. Prüfen Sie den Ressourcen- und Gruppenzustand mit dem Failovercluster-Manager oder mit dem Windows PowerShell-Cmdlet "Get-ClusterResource".

""

 

Users reports that before crashing of the server, the session freezes (no keyboard input or mouse move) and then tries to reconnect without result.

 

We do not have any error-message in any log that can help us to find the issue.

 

The issue appears only on servers that have more than 30 sessions connected. Other servers with less then 10 sessions with same image are not effected.

 

Does someone report same issue with Server 2016 ???

Link to comment

4 answers to this question

Recommended Posts

  • 0

Now I've seen the issue on my own session:

 

While working, the session freezes, no mouse move, no keyboard input.

The screen goes blurred.

After round about 1 or 2 minutes the screen goes black and the sessions disconnects.

 

Looking in the Hyper-V logs, the VM was rebooted by Hyper-V.

 

Still no further entries in eventlogs of the VM and in CDF-Trace logs of Citrix Provisioning Servers.

 

I've found an article that describes that Hyper-V looks at every VM if the OS is reachable. If not, the VM will be restarted by Hyper-V
and logs it with the error (sorry still in german):

 

""

Fehler in der Clusterressource "XXXXXXXXXXXX" des Typs "Virtual Machine" in der Clusterrolle "XXXXXXXX Resources".

Abhängig von den Fehlerrichtlinien für die Ressource und die Rolle wird vom Clusterdienst möglicherweise versucht, die Ressource auf diesem Knoten online zu schalten oder die Gruppe auf einen anderen Knoten des Clusters zu verschieben und die Ressource dann neu zu starten. Prüfen Sie den Ressourcen- und Gruppenzustand mit dem Failovercluster-Manager oder mit dem Windows PowerShell-Cmdlet "Get-ClusterResource".

""

This article describes the result of Hyper-V:

https://blogs.msdn.microsoft.com/clustering/2013/01/24/understanding-how-failover-clustering-recovers-from-unresponsive-resources/

 

This happens only on VMs that has more than 20 sessions in peak logged on. And only when users logged off and near 10 sessions are still connected.

 

And to make it complicated, not at all. Only radomly and not every day :-(

 

On Citrix RDS with the same PVS-Image but in another DeliveryGroup with 10 sessions in peak, the issue has not happened at all.

 

Very strange ... what can I do further on ???

 

 

 

Link to comment

Create an account or sign in to comment

You need to be a member in order to leave a comment

Create an account

Sign up for a new account in our community. It's easy!

Register a new account

Sign in

Already have an account? Sign in here.

Sign In Now
×
×
  • Create New...