Quantcast
Channel: The Official System Center Service Manager Blog
Viewing all 26 articles
Browse latest View live

Troubleshooting Service Manager work item (Incident, Change Request, Service Request) status stuck on “New”

$
0
0

Any time we see newly created work items stuck with a “New” status, it generally means that the Service Manager workflows are not processing or are processing slowly. Monitoring the “Minutes behind” of each workflow can be a useful method of troubleshooting.

The following web page has good troubleshooting tips and a SQL query that can be used to display the “Minutes behind” of each workflow. A similar SQL Server Management Studio SQL query is also shown below:

-- Use ServiceManager
-- Select Name, is_broker_enabled from sys.databases Where name = 'ServiceManager'
-- Line above added because it needs to be 1 or some stuff will not run.  Confirm is_broker_enabled set to 1
-- Select above is remarked out because it is not directly related to the purpose of this blog posting.
-- SubscriptionStatus.sql    -- Workflow / subscription status
Use ServiceManager
 DECLARE @MaxState INT, @MaxStateDate Datetime, @Delta INT, @Language nvarchar(3)
 SET @Delta = 0
 SET @Language = 'ENU'
 SET @MaxState = (
    SELECT MAX(EntityTransactionLogId)
    FROM EntityChangeLog WITH(NOLOCK)
 )
 SET @MaxStateDate = (
 SELECT TimeAdded
 FROM EntityTransactionLog
 WHERE EntityTransactionLogId = @MaxState
)
SELECT
    LT.LTValue AS 'Display Name',
  S.State AS 'Current Workflow Watermark',
 @MaxState AS 'Current Transaction Log Watermark',
 DATEDIFF(mi,(SELECT TimeAdded
     FROM EntityTransactionLog WITH(NOLOCK)
     WHERE EntityTransactionLogId = S.State), @MaxStateDate) AS 'Minutes Behind',
 S.EventCount,
 S.LastNonZeroEventCount,
 R.RuleName AS 'MP Rule Name',
    MT.TypeName AS 'Source Class Name',
    S.LastModified AS 'Rule Last Modified',
 S.IsPeriodicQueryEvent AS 'Is Periodic Query Subscription', --Note: 1 means it is a periodic query subscription
    R.RuleEnabled AS 'Rule Enabled', -- Note: 4 means the rule is enabled
 R.RuleID
 FROM CmdbInstanceSubscriptionState AS S WITH(NOLOCK)
 LEFT OUTER JOIN Rules AS R
    ON S.RuleId = R.RuleId
 LEFT OUTER JOIN ManagedType AS MT
    ON S.TypeId = MT.ManagedTypeId
 LEFT OUTER JOIN LocalizedText AS LT
 ON R.RuleId = LT.MPElementId
 WHERE
    S.State <= @MaxState - @Delta
 AND R.RuleEnabled <> 0
 AND LT.LTStringType = 1
 AND LT.LanguageCode = @Language
 AND S.IsPeriodicQueryEvent = 0
 /* to look at a specific workflow uncomment on of the following */
 -- AND LT.LTValue  LIKE '%Test%'
 -- AND S.RuleId='1D74409B-B2D9-8C45-6702-AB8C94AA0694'  -- aka Display Name="New Change Request Workflow"'
 ORDER BY S.State Asc   

Troubleshooting Workflow Performance and Delays

We run the above SQL query many times waiting a few minutes between each execution to see how the “Minutes Behind” for each workflow changes. We scroll to the bottom of the list to determine the number of workflows in the normal range between executions. 2 minutes or less is normal:

  • is the “Minutes Behind” static
  • is the “Minutes Behind” only static for a few workflows.   It may be that the workflow is disabled, or that there is a management pack override disabling the workflow even though it shows up as enabled, or possibly it is a custom workflow that is not working properly.
  • Is the “Minutes Behind” continuously increasing for all workflows or only some of the workflows
  • Are all the workflows are impacted (greater than 2 minutes behind)
  • Are the “Minutes Behind” continuously increasing or does it go down on occasion.

The solution in this blog is intended to be used when 98% or more of the workflow “Minutes Behind” are static or continuously increasing over time.   If the workflow “Minutes Behind” is up and down as you execute the SQL query over and over then the troubleshooting steps in the web link above Troubleshooting Workflow Performance and Delays is more appropriate. Below is the list of common issues and solutions that we see from time to time on the Microsoft support lines when 98% or more of the workflow “Minutes Behind” are static or continuously increasing over time:

LIST OF ISSUES / SOLUTIONS:

– Most of the time the issue is resolved in a single minute by stopping the System center services on the Primary Management server, deleting Health Service Statefolder, and then restarting the services.  

 There are probably several causes however the most common is SQL server was restarted and the Service Manager Services Timed out trying to reach the SQL server. The following PowerShell steps can be used to reduce the time it takes to stop the services, delete the subfolder “Health Service State” and restart the services. The best way to prevent this problem is to put in a process to stop Service Manager services before applying updates to the SQL server and/or any other time that the Service Manager SQL server is restarted. After the SQL Service has been up and running for 5 minutes then restart the Service Manager services.

## Ideal stopping order:

Stop-Service HealthService ; Stop-Service OMCFG; Stop-Service OMSDK

Get-Service HealthService,omcfg,omsdk;

 ## You can use the following to open the Service Manager folder

## From the Service Manager folder delete or rename the "Health Service State" subfolder

$SMFolder = (Get-ItemProperty "HKLM:\SOFTWARE\Microsoft\System Center\2010\Common\Setup").InstallDirectory

Start $SMFolder

 ## ideal starting order (reversed from stopping)

Start-Service OMSDK ; Start-Service OMCFG; Start-Service HealthService

Get-Service HealthService,omcfg,omsdk;(Get-date).ToString()

– The “Microsoft Monitoring Agent” in Control Panel should not have any management server listed on the Service Manager primary management server, or other Service Manager management servers. If you have a server listed in the “Microsoft Monitoring Agent Properties” it should be removed and the option “Automatically update management group assignments from AD DS” should be unchecked.

If you want to monitor the management server please review the following document:

Microsoft System Center Management Pack for System Center Service Manager

Under “Mandatory Configuration” page 6

“…You should also ensure that the Service Manager management servers are configured for agentless monitoring…”

I have seen customers use it. Sometimes it works for a long time and then comes the hair pulling. Do not be tempted. Running the SCOM agent locally will on rare occasions cause unexpected behavior.

The following items below are unlikely to help if the “Minutes Behind” is only for some workflows, or if the “Minutes Behind” for the workflows is going down and up.

It is normal to have workflows with 0, 1, or 2 minutes. If “Minutes Behind” is going down then there is likely a SQL Load issue as mentioned in Troubleshooting Workflow Performance and Delays. If the “Minutes Behind” is not changing or increasing over time please review the possible solutions below:

– The workflows only run from the Service Manager Primary Management Server.   Execute the following SQL Query again the ServiceManager Data Base and confirm the Primary Management server name.   Is the server up and running?

-- Display the primary management server

Use ServiceManager

select DisplayName, [PrincipalName] from [MTV_Computer]

where [BaseManagedEntityId]=

(SELECT ScopedInstanceId

FROM dbo.[ScopedInstanceTargetClass]

WHERE [ManagedTypeId] = dbo.fn_ManagedTypeId_MicrosoftSystemCenterWorkflowTarget()

)

– Are the Services running on the Primary Management server? From an elevated Powershell prompt:

PS C:\> Get-Service HealthService,omcfg,omsdk
Status   Name               DisplayName
-----   ----               -----------
Running HealthService     Microsoft Monitoring Agent
Running omcfg             System Center Management Configuration
Running omsdk             System Center Data Access Service

– The “HKLM\SOFTWARE\Microsoft\System Center\2010\Common\MOMBins\Value1” registry value is required to connect to SQL database.   Also the encryption key in Value1 must match the SQL server database that it was generated from and the Management servers FQDN name. Meaning the computer name of the Service Management server and the domain that it belongs to cannot be changed.

– Is the Primary Management server listed in the SQL Health Service table in SQL?

-- Display Service Manager Management servers
Use ServiceManager
Select * from MT_HealthService

Is the primary management server listed in the MT_HealthService? If no then the Primary Service Manager Management Server Windows computer management object was deleted. Rare however sometimes customer accidentally deleted Windows Computer object for the Management server using Powershell or via Service Manager Console, “All Windows Computers”. If deleted via the GUI it should still exist until the items are cleared from Service Manager Console “Deleted Items”.   If missing promote a secondary Service Manager Management server to a primary Management server.   If no management servers are present in MT_HealthService SQL table then Service Manager Database must be Restored and existing tickets have to be recreated. Attempting to restore just the MT_HealthService table will not work.   Microsoft Development team has confirmed that when the Service Manager management Windows computer object is deleted many other interrelated changes occur to the Service manager database requiring the ServiceManager database be restored.

– If the password has been changed even if it has been changed back, reset the password in the Service Manager Console to see if it corrects the workflow problem.

Reset / retype the password of the Service Manager Workflow account stored using the following steps:

Service Manager Console > Administration > Administration > Security > Run as Accounts

Then double click the account the and type in the password

– Service Account Authentication problem or SCSM workflow account authentication problem:

Log Name:      Operations Manager
Source:        HealthService
Event ID:      7000
Level:         Error
Description:
The Health Service could not log on the RunAs account CONTOSO\SvcMgrWork for management group ServiceMgmtGroup.  The error is The user name or password is incorrect.(1326L).  This will prevent the health service from monitoring or performing actions using this RunAs account.

Log Name:      Operations Manager
Source:        HealthService
Event ID:      7000
Level:         Error
Description:
The Health Service cannot verify the future validity of the RunAs account CONTOSO\SvcMgrWork for management group ServiceMgmtGroup.  The error is The user name or password is incorrect.(1326L).

The causes can vary. Account has been deleted from Active Directory, Password has Expired, Account is disabled , time is greater than 5 minutes between systems causing a Kerberos authentication failures.   From the Service Manager primary management server you can run the following from an elevated Powershell prompt against the system event log and it might confirm a Kerberos problem. You may need to re-enter SCSM workflow account under Service Manager Console > Administration > Administration > Security > User Roles.

Get-WinEvent -Logname system | ?{$_.Message -like "*KRB_AP_ERR_MODIFIED*"}

The following can be used on different systems to determine if the UTC time is near the 5 minute difference, replacing DomainControllerServerName with the name or your DC:

w32tm.exe /stripchart /computer:DomainControllerServerName

– Check if the PID of the HealthService service is changing often. This would indicate that the service is crashing and then restarting.

Lastly, if you workflows start running properly “Minutes Behind” at 0, 1, or 2 minutes then new workitems should work as expected.   In some cases previous workitems may need to have the status reset with Powershell.

Search keywords:

Workitem status not updating

Workitem stuck on new

Workitem status not changing

Incident status not changing

Service Request status not changing

Change Request status not changing

  • Austin Mack, Sr. Support Escalation Engineer, Microsoft

KB: Configuration Manager 2007 client operations fail after you install a May 2017 security update for Windows Server 2008 R2

$
0
0

We have released a new KB article Configuration Manager 2007 client operations fail after you install a May 2017 security update for Windows Server 2008 R2 which contains a solution to the following issue:

Client-related operations fail in an installation of Microsoft System Center Configuration Manager 2007 that has the server locator point (SLP) role after you install one of the following May 2017 security updates for Windows Server 2008:

4018556 Security update for the Windows COM Elevation of Privilege Vulnerability in Windows Server 2008: May 9, 2017

4019263 May 9, 2017—KB4019263 (Security-only update)

4019264 May 9, 2017—KB4019264 (Monthly Rollup)

Note This problem does not affect System Center Configuration Manager 2012 or the current branch version of the program.

This problem can affect the following operations:

  • New client registrations
  • Client assignments to new sites
  • Client reinstallations

Also, you receive a “Could Not Initialize” error message if you browse to the following location:

http://localhost/sms_slp/SLP.dll?site&SC=<sitecode>

Note In this message, <sitecode> represents your actual site code. This error message resembles the following screen shot:

Cause

The worker process typically runs under the LOCAL SERVICE account. However, after you apply one of the updates that are mentioned in the “Symptoms” section, the LOCAL SERVICE account is removed. This causes the worker process to be moved to the System account, and the SLP becomes inaccessible.


Workaround

The worker process typically runs under the LOCAL SERVICE account. However, after you apply one of the updates that are mentioned in the “Symptoms” section, the LOCAL SERVICE account is removed. This causes the worker process to be moved to the System account, and the SLP becomes inaccessible.

  1. Open the Properties window of the SLPExec.exe file. by default, this file is located in the following folder:
    c:\SMS\SMS_SLP
    Note If you don’t know where the SLPExec.exe file is located, go to IIS, browse to the default website, and then look under SMS_SLP and content view. Click View Permissions to see the full path.
  2. In the Group or user names area, add LOCAL SERVICE.
  3. Grant the Read & execute permission for LOCAL SERVICE

After you grant the permission, try again to access the URL that generated the error. If the XML information is displayed, the problem is temporarily resolved.

For the official KB article see https://support.microsoft.com/help/4035047.

System Center 2016 now supports TLS1.2 security protocols

$
0
0

TLS 1.2 is the secure way of communication suggested by Microsoft with best-in class encryption. SSL and early TLS are not considered strong cryptography and cannot be used as a security control. Microsoft has added official support for TLS1.2 security protocols in System Center 2016. This protocol is now supported in the following products:

  • System Center Operations Manager (SCOM)
  • System Center Virtual Machine Manager (SCVMM)
  • System Center Data Protection Manager (SCDPM)
  • System Center Orchestrator (SCO)
  • Service Management Automation (SMA)
  • Service Provider Foundation (SPF)
  • System Center Service Manager (SCSM)

The 3 step process of enabling TLS1.2 involves:

  1. Installing the updates for Windows Server Security, .NET 4.6, SQL Server & System Center 2016 UR4. Please note that for  Service Management Automation(SMA) and Service Provider Foundation(SPF) you need to upgrade to their most recent UR as UR4 does not have any updates to these components. For Service Management Automation (SMA) upgrade to Update Rollup 1 and for Service Provider Foundation (SPF) to Update Rollup 2, For SMA also update the SMA 2016 MP from here. System Center Virtual Machine Manager(SCVMM) should be upgraded at least to Update Rollup 3.
  2. Changing the configuration settings to enable TLS1.2 in the Windows Environment and System Center across all components.
  3. Making additional System Center component specific settings.

You can find more details in the System Center 2016 TLS1.2 Configuration article.

System Center Service Manager (SCSM) Authoring Management Pack tips and best practices

$
0
0

When using the SCSM Authoring Management Pack, follow these tips and best practices for best results. These are not always presented in order, so please read through this post before setting up your MP.

  • Disable the MPSSyncJob Schedule before importing custom Management Packs until you determine you like the results of your custom changes on the SM side.
  • Extend an existing class versus creating a new class.  Extending an existing class will make it more compatible with 3rd party  software
  • Create only one .SNK key file.  If you create multiple .SNK files you will have to keep track of which .SNK file was used on which Management  pack.
  • Keep your .SNK key with your custom management packs in a safe place.  You MUST have the .SNK key to update any sealed Management pack
  • Cannot import sealed management pack over  a unsealed Management pack.  “Cannot import sealed management pack {0}.  The database already contains an unsealed management pack”
  • The file name of the management pack matters.  It must match the name in the Management Pack.
  • Internal property names are not allowed to have spaces
  • Unable to SEAL management pack XML that was previously sealed. “The Target attribute value is not valid.  Element…”  remove the ManagementPackPublicKeyToken line from the XML file.    You will get a similar error if you export a sealed MP and then attempt to import the XML directly into Service Manager.
  • If you remove a management pack that defines classes / class properties all data associated with the class will be removed from Service Manager.  Does not Impact Data Warehouse.  Data Warehouse is designed to be a repository.  That also means the properties on DW are still defined.  This is why you need to confirm all is well before Mpsynjob runs.
  • If problems occur with a new management pack the simplest step is to remove it provided it does not contain fields/properties.  See previous bullet item.
  • SLOW opening a custom form modified by Service Manager Authoring tool.  To fix use: SCSM Form Customization Cleanup Script https://gallery.technet.microsoft.com/SCSM-Form-Customization-8ebe8dd7
  • Strongly recommended that forms and classes are not stored in the same management pack.
  • Service Manager SLOs must be created via the console.   During the wizard it will prompt for a management pack.  Ensure that you specify a management that only contains SLOs, do not store other objects in the MP.  SLOs get messed up when importing via Management Pack.  The entire MP often has to be removed to re-create SLO via the Console.
  • MPSyncJob will only import Sealed MPs into Data Warehouse.  Exception to this is: List items (aka “enumerations“) and groups/queues will be sync’d to the DW regardless of whether they are in a sealed or unsealed MP
  • After a sealed Management Pack is imported the following cannot be performed without removing the Management Pack
    •     Remove Classes
    •     Remove Properties
    •     Changing the Property type – you will see an error message similar to this:
"The property Type (PropertyXYZ) has a value that is not upgrade compatible. OldValue=enum, NewValue=string"
  • If you do remove the sealed management pack and it was already been imported into Data Warehouse the fields defined will remain on the Data Warehouse side if MPSyncJob already ran.  This goes back to the first tip ‘disable MPSyncJob’ until you are certain everything is working and the fields as spelled correctly.
  • Always test Management packs in a test environment before testing in the production environment.  Keep backups of previous versions of Management Packs so that you can go back to a working state.
  • An unsealed management pack cannot be referenced by another management pack (https://msdn.microsoft.com/en-us/library/hh964841.aspx)
  • All management packs published by Microsoft will have the public key token 31bf3856ad364e35
  • SCSM 2012 R2 MP import allows a string property length to be increased.  Although DWJob may fail until MPSyncjob completes to move the changes to the DW side.
  • Management pack Boolean True will not work must be true (lower case)
  • Sealing a management pack changes the GUID identity of the management pack and therefore the GUID identity all of the management pack elements inside of the management pack change.  Changing the name of the MP will do the same thing.
  • NEVER create a string larger than 4000 bytes if you plan to use Data Warehouse…  In general you should only make strings as big as you need.  It is easy to increase the size however decreasing the size is an entirely different story.

-Austin Mack

Microsoft Support

In case you missed it! The latest System Center release, version 1801, is here.

$
0
0

In February we announced that System Center, version 1801 was now available.  It’s the first release in our new Semi-Annual Channel and delivers new features and enhancements based on customer feedback. It builds on the capabilities of System Center 2016 and has support for the latest version of Windows Server, version 1709 as well as Windows Server 2016. It includes enhanced Linux monitoring support, more efficient VMware backup, and improved user experience and performance. For more details, read the full announcement..

Update Rollup 5 for System Center 2016 Service Manager is released

$
0
0
Update Rollup 5 for System Center 2016 Service Manager is now available. See the article Update Rollup 5 for System Center 2016 Service Manager for a description of issues that are fixed and the improvements that are included. This article also contains the installation instructions for this update.... Read more
Viewing all 26 articles
Browse latest View live


<script src="https://jsc.adskeeper.com/r/s/rssing.com.1596347.js" async> </script>