Disabling Hardware sensor - Bias current sensors

June 24, 2018, 7:10 am

≫ Next: Monitor a specific URL - failing

≪ Previous: Find Alerts with Specific Node Properties

Hi Guys,

I'm new with Solarwinds - orion, need your help here.

I'm trying to organize our alerts console, clean uncheery/buggy alerts.

I stumbled upon the Cisco bias current sensors alerts, from digging the forums I found the changing the default MIB should solve things, well it worked but not for all devices,

for the rest, I wanted to disable the hardware sensor to eliminate these buggy alerts.

strange thing is, I'm disabling the alerting sensors, alerts go away and everything is good, but then, after 10-15 min they are re-enabled by themselves

How can I handle this mystery?

your help is much appreciated.

thanks,

Alex.

↧

Monitor a specific URL - failing

June 25, 2018, 3:21 am

≫ Next: NPM Server specs/issues

≪ Previous: Disabling Hardware sensor - Bias current sensors

I am trying to set up URL monitoring in NPM – some are External URLs some Internal URLs

Following instruction for Monitor a specific URL > Add Node > Polling = External Node > choose Web Link > check box ‘inherit credentials from template’ > Test >

Test fails – ‘testing on node wmsmeet.iscmotorsports.com failed with Down status, the underlying connection was closed” an unexpected error occurred on a send’

What is this URL monitoring trying to do? How does it monitor? What template is the credentials applying to? What credentials does it need?

↧

NPM Server specs/issues

June 25, 2018, 6:31 am

≫ Next: Device not showing up for IP SLA tracker

≪ Previous: Monitor a specific URL - failing

So we have Solarwind's running on three virtual machines and a physical server for the SQL. (They are all 2012 R2 Servers)

A primary poller, additional poller and the NTA storage server.

The 3 vm's are built identically , 8 cpu's, 32 Gb Ram 60 Gb "C" Drive and 100 Gb D drive (application space and swapfile)

The issue we have is that NPM diagnostics often throw up issues and NPM is very slow to unusable.

Can anyone give me the definitive answer as to how much memory and cpu should be allocated for a vm ?

↧

Device not showing up for IP SLA tracker

June 25, 2018, 10:03 am

≫ Next: SoalrWinds Trap Service stops and starts randomly.

≪ Previous: NPM Server specs/issues

We have a couple of 3850-Xs that we recently added. I have IP SLA operations configured on them (via CLI) that I would like to monitor on the IP SLA module. I have other devices setup but i do not see these new 3850s on the list of devices to select from.

I have those devices under read/write SNMPv3 with NCM setup as well - all working great.

Am I missing a step to add it to the IP SLA monitoring?

Thanks!
Sneha

↧

SoalrWinds Trap Service stops and starts randomly.

June 25, 2018, 1:32 pm

≫ Next: Is it possible to load UNIX load average from a script (not SNMP)

≪ Previous: Device not showing up for IP SLA tracker

Hello All,

I just started working with SolarWinds and I was asked to look into an issue with the SolarWinds Trap Service stopping and starting randomly. The environment currently looks like this, Orion Platform 2017.1.3 SP3, NPM 12.1, QoE 2.3, VIM 7.0.0, NetPath 1.1.0. There are a few KB articles regarding this. The article suggests that I check the the Trap Service and make sure that it's the only service that is using port 161. I've verified and it is.

I'm currently looking into the TrapService.log file and this is what I'm seeing.

C:\ProgramData\Solarwinds\Logs\Orion\TrapService.log.1

*** Assembly SolarWinds.Net.SNMP, Version=0.0.0.0, Culture=neutral, PublicKeyToken=null, .NET version v4.0.30319 ***

*** Assembly SolarWinds.Common, Version=0.0.0.0, Culture=neutral, PublicKeyToken=null, .NET version v4.0.30319 ***

*** Assembly SolarWinds.Orion.Core.Collector.MessageSender, Version=2017.1.5300.1698, Culture=neutral, PublicKeyToken=null, .NET version v4.0.30319 ***

2018-06-25 12:57:35,384 [1166] ERROR Main - SendRequest Error: Exception of type 'System.OutOfMemoryException' was thrown.

2018-06-25 12:57:35,384 [1166] ERROR Main - SNMPManager::InternalQuery() unable to send request

2018-06-25 12:57:35,384 [206] ERROR Main - SendRequest Error: Exception of type 'System.OutOfMemoryException' was thrown.

2018-06-25 12:57:35,384 [206] ERROR Main - SNMPManager::InternalQuery() unable to send request

2018-06-25 12:57:35,384 [327] ERROR TrapService.OID - Error retrieving OID!

System.Data.OleDb.OleDbException (0x80004005): System resource exceeded.

at System.Data.OleDb.OleDbCommand.ExecuteCommandTextErrorHandling(OleDbHResult hr)

at System.Data.OleDb.OleDbCommand.ExecuteCommandTextForSingleResult(tagDBPARAMS dbParams, Object& executeResult)

at System.Data.OleDb.OleDbCommand.ExecuteCommandTextForSingleRow(tagDBPARAMS dbParams, Object& executeResult)

at System.Data.OleDb.OleDbCommand.ExecuteCommandText(Object& executeResult)

at System.Data.OleDb.OleDbCommand.ExecuteCommand(CommandBehavior behavior, Object& executeResult)

at System.Data.OleDb.OleDbCommand.ExecuteReaderInternal(CommandBehavior behavior, String method)

at System.Data.OleDb.OleDbCommand.ExecuteReader(CommandBehavior behavior)

at TrapService.MibDbHelper.GetOIDFromDB(String oidString, String oriOid)

at TrapService.OID.RetrieveOID(String oidString)

The Windows logs also show an error regarding the TrapService...

APPLICATION LOG

Faulting application name: SWTrapService.exe, version: 2017.1.5300.1698, time stamp: 0x58ac46a9

Faulting module name: unknown, version: 0.0.0.0, time stamp: 0x00000000

Exception code: 0xc0000005

Fault offset: 0x00e6f2cf

Faulting process id: 0x5cfc

Faulting application start time: 0x01d40cc0fc09615d

Faulting application path: C:\Program Files (x86)\SolarWinds\Orion\SWTrapService.exe

Faulting module path: unknown

Report Id: c7152603-78b4-11e8-90a6-005056b530a1

SYSTEM LOG

The SolarWinds Trap Service service terminated unexpectedly. It has done this 300 time(s). The following corrective action will be taken in 60000 milliseconds: Restart the service.

Rebooting the server as well as manually stopping and starting the service hasn't resolved the issue.

I already have a case open with SolarWinds support; however, I've heard great things about this community and I wanted to reach out here and start to become an active member.

Thanks in advance for any insight.

P.S. This is my first time posting and I'm not sure if I've posted in the correct place. If I haven't, please advise where to post. Thanks again.

↧

Is it possible to load UNIX load average from a script (not SNMP)

June 25, 2018, 1:54 pm

≫ Next: I am getting an error on additional poller, what can be the reason for the same?

≪ Previous: SoalrWinds Trap Service stops and starts randomly.

We have agents running on Linux machines, where the load average (processes waiting for CPU) show up with history charts:

We don't have agents pushed out to AIX yet (a couple in test), nor is SNMP fully scaled out to read these metrics. I have (wrote) Perl scripts that read data from CSV files created by NMON agents (like top, but smarter). That data can also be viewed similar to the above.

Is there a way to feed this data to the same place the Linux/SNMP data go, so the load average shows up looking the same?

↧

I am getting an error on additional poller, what can be the reason for the same?

June 25, 2018, 5:53 pm

≫ Next: Cisco ASA goes down on Failover

≪ Previous: Is it possible to load UNIX load average from a script (not SNMP)

Hi,

I am getting an error on one of the additional poller, what can be the reason for the same?

There was an error updating the Engine Keep alive record

Error Detail-System.Data.SqlClient.SqlException (0x80131904): A network-related or instance-specific error occurred while establishing a connection to SQL Server. The server was not found or was not accessible. Verify that the instance name is correct and that SQL Server is configured to allow remote connections. (provider: Named Pipes Provider, error: 40 - Could not open a connection to SQL Server) ---> System.ComponentModel.Win32Exception (0x80004005): Access is denied

at System.Data.SqlClient.SqlInternalConnectionTds..ctor(DbConnectionPoolIdentity identity, SqlConnectionString connectionOptions, SqlCredential credential, Object providerInfo, String newPassword, SecureString newSecurePassword, Boolean redirectedUserInstance, SqlConnectionString userConnectionOptions, SessionData reconnectSessionData, DbConnectionPool pool, String accessToken, Boolean applyTransientFaultHandling)

at System.Data.SqlClient.SqlConnectionFactory.CreateConnection(DbConnectionOptions options, DbConnectionPoolKey poolKey, Object poolGroupProviderInfo, DbConnectionPool pool, DbConnection owningConnection, DbConnectionOptions userOptions)

at System.Data.ProviderBase.DbConnectionFactory.CreatePooledConnection(DbConnectionPool pool, DbConnection owningObject, DbConnectionOptions options, DbConnectionPoolKey poolKey, DbConnectionOptions userOptions)

at System.Data.ProviderBase.DbConnectionPool.CreateObject(DbConnection owningObject, DbConnectionOptions userOptions, DbConnectionInternal oldConnection)

at System.Data.ProviderBase.DbConnectionPool.UserCreateRequest(DbConnection owningObject, DbConnectionOptions userOptions, DbConnectionInternal oldConnection)

at System.Data.ProviderBase.DbConnectionPool.TryGetConnection(DbConnection owningObject, UInt32 waitForMultipleObjectsTimeout, Boolean allowCreate, Boolean onlyOneCheckConnection, DbConnectionOptions userOptions, DbConnectionInternal& connection)

at System.Data.ProviderBase.DbConnectionPool.TryGetConnection(DbConnection owningObject, TaskCompletionSource`1 retry, DbConnectionOptions userOptions, DbConnectionInternal& connection)

at System.Data.ProviderBase.DbConnectionFactory.TryGetConnection(DbConnection owningConnection, TaskCompletionSource`1 retry, DbConnectionOptions userOptions, DbConnectionInternal oldConnection, DbConnectionInternal& connection)

at System.Data.ProviderBase.DbConnectionInternal.TryOpenConnectionInternal(DbConnection outerConnection, DbConnectionFactory connectionFactory, TaskCompletionSource`1 retry, DbConnectionOptions userOptions)

at System.Data.ProviderBase.DbConnectionClosed.TryOpenConnection(DbConnection outerConnection, DbConnectionFactory connectionFactory, TaskCompletionSource`1 retry, DbConnectionOptions userOptions)

at System.Data.SqlClient.SqlConnection.TryOpenInner(TaskCompletionSource`1 retry)

at System.Data.SqlClient.SqlConnection.TryOpen(TaskCompletionSource`1 retry)

at System.Data.SqlClient.SqlConnection.Open()

at SolarWinds.Orion.Common.DatabaseFunctions.InnerCreateConnection(IsolationLevel isolationLevel, Boolean throwException, String customConnectionString, Credential credentials, Boolean useCurrentlyLoggedInUser)

at SyslogService.SyslogService.WriteKeepAlive(Object state)

ClientConnectionId:00000000-0000-0000-0000-000000000000

Error Number:5,State:0,Class:20

↧

Cisco ASA goes down on Failover

June 26, 2018, 3:30 am

≫ Next: Downloaded the 12.3 offline installer (2.9Gb) and the installation won't continue because there is no internet connection

≪ Previous: I am getting an error on additional poller, what can be the reason for the same?

Hi,

We have an NPM 12.3 HA environment.

While testing HA failover, noticed that some of Cisco ASA devices goes down.

The network parameters have been validated with the devices and the secondary server.

If anybody had faced similar issues , please help in resolving this as well.

Thanks and Regards

Richa Arya

↧

Downloaded the 12.3 offline installer (2.9Gb) and the installation won't continue because there is no internet connection

June 26, 2018, 3:33 am

≫ Next: Any way to have NPM (or Windows) mark DSCP values on its ping packets?

≪ Previous: Cisco ASA goes down on Failover

Downloaded the 12.3 offline installer (2.9Gb) and the installation won't continue because there is no internet connection, during the initial test phase it comes back with a couple of recommendations which are all passable bar the message 'No Internet Connection Detected' and it suggests to me to download the offline installer, can anyone let me know what I am doing wrong here

↧

Any way to have NPM (or Windows) mark DSCP values on its ping packets?

June 26, 2018, 6:17 am

≫ Next: Unexpected Website Error Exception of type 'SolarWinds.ApiProxyFactory.ApiProxyException' was thrown.

≪ Previous: Downloaded the 12.3 offline installer (2.9Gb) and the installation won't continue because there is no internet connection

Anyone,

Our Cisco WAN has a pretty complex QOS configuration that marks all traffic with one of several DSCP values with a guaranteed bandwidth per class. The pings initiated by NPM for device up/down end up in a low priority class that we know has drops occasionally. As a results I get false positives on devices being down. I've used Windows group policy in the past to mark UDP or TCP traffic (using it for SNMP with NPM right now). But it doesn't touch ICMP packets. Anyone know of a way within Windows or NPM itself to self-mark DSCP values of other than 0 on ICMP?

Thanks,

Chuck

↧

Unexpected Website Error Exception of type 'SolarWinds.ApiProxyFactory.ApiProxyException' was thrown.

June 26, 2018, 6:40 am

≫ Next: Custom Netflow Monitoring

≪ Previous: Any way to have NPM (or Windows) mark DSCP values on its ping packets?

Hi,

I am running NPM 12.0.1. I'm not sure what has changed or may have caused this issue but I am unable to add nodes for NPM to monitor. Every time I try to add a node I get to the point where I choose the resources I want to monitor and then I select next and get the following error.

Unexpected Website Error

Exception of type 'SolarWinds.ApiProxyFactory.ApiProxyException' was thrown.

I have a case open for it but no resolution as of yet. Has anyone else ran into this issue?

Thanks in advance,

Sean

↧

Custom Netflow Monitoring

June 26, 2018, 6:44 am

≫ Next: Looking for an Alert owner (who has created and when?) with SQL Query

≪ Previous: Unexpected Website Error Exception of type 'SolarWinds.ApiProxyFactory.ApiProxyException' was thrown.

I am very new to SolarWinds, but have been given the task of creating dashboards for for individuals based on the needs, wants, etc of their specific jobs. I am trying to find a resource that allows Netflow monitoring based on a specific criteria that I give the resource, without having to go to the Netflow page, then filter through there. I have found the resources that are already in SolarWinds, but I don't see the ability to edit them for the needs that I am having. Is the best option here to create a custom resource, or is there a way to edit these current resources to fit the needs that I have? Thanks in advance for the help!

↧

Looking for an Alert owner (who has created and when?) with SQL Query

June 26, 2018, 6:48 am

≫ Next: Why is 8 greater than 35 in Orion?

≪ Previous: Custom Netflow Monitoring

Hi,

I am looking for an Alert owner (who has created and when?) with SQL Query. And also let me know if manage configured alert does save somewhere in SolarWinds Server like Report (C:\Program Files (x86)\SolarWinds\Orion\Reports).

↧

Why is 8 greater than 35 in Orion?

June 26, 2018, 7:09 am

≫ Next: Question on NPM graph report

≪ Previous: Looking for an Alert owner (who has created and when?) with SQL Query

I have a node which has triggered a temperature alert (and the accompanying reset) the last few nights, with no evidence of such an event if I look at the logs on the device itself. This Alarm/Action is setup on several dozen nodes but this only seemed to be happening on one of them. Then I realized that this node was the only one going below 10 degrees. I have a query set up on one of my views that sorts the temperatures descending, and lo and behold the node with a 8 degree temp is at the top of the list, above all of the nodes in the 20s.

This is annoying, but what is unacceptable is that alerts are being triggered on this same math.

I'm assuming the query view is alphabetizing rather than checking a numerical value in my alert. However, why would the alert itself do that?

Is there something wrong in my alert?

↧

Question on NPM graph report

June 26, 2018, 9:58 am

≫ Next: Question about disk performance.

≪ Previous: Why is 8 greater than 35 in Orion?

On the NPM interface graph report, is there a way that shows what usage has been to a specific host? I tried netflow – but not what am looking for.

Let say the bandwidth utilization is 50Mbps out of 100Mbps circuit on the graph and I want to generate a report that give details on how much of the bandwidth was use by a specific user from the graph so I can see a trending on what % of BW this user is taking to know what is the sustained requirements are.

↧

Question about disk performance.

June 26, 2018, 10:02 am

≫ Next: SQL errors in the logs.

≪ Previous: Question on NPM graph report

Hi all,

Could anyone out there tell me how you have your environment setup to provide best possible performance?

1. We are all virtual.

2. We are on SAN storage.

3. We are a decently large environment with a pretty large database. and several additional pollers.

We believe the problem to be related to SAN or Disk configuration. But as you might of guess storage and windows teams say otherwise. What is the best way to setup virtual disks that reside on a SAN for optimal performance? Do they need to be dedicated for the database? Do they need to be tier 0 or higher on tiered storage? How can they be optimized for speed and not storage? And what numbers should we be looking at?

Anyone that can help with parameters and suggestions would be greatly appreciated.

thanks,

leandro

↧

SQL errors in the logs.

June 26, 2018, 10:04 am

≫ Next: Question for the community... Need your help with some issues I'm having.

≪ Previous: Question about disk performance.

Hi all,

Question: I'm seeing a lot of errors related to an item with the same key already exist or a prinary key couldn't be updated due to duplicates and things of this nature. What's the best way to find the cause and get it to stop.

Here is an exmaple:

Violation of PRIMARY KEY constraint 'PK__#DownTim__913A95523B22EC15'. Cannot insert duplicate key in object 'dbo.#DownTimeEntitiesToMerge'. The duplicate key value is (Jun 26 2018 3:31PM, 10645, Orion.ADM.NodeInventory).

Thanks,

leandro

↧

Question for the community... Need your help with some issues I'm having.

June 26, 2018, 1:34 pm

≫ Next: Agent monitoring brings me endless headaches.

≪ Previous: SQL errors in the logs.

Hi all,

So I have the perfect storm of issues I've been weathering for nearly 2 years now with no resolution. I was wondering if anyone had these issues and if you could share some tips that might help guide me in the right direction. Between unstable environment and early morning calls telling me the environment is down I have been living in stress and haven't been able to sleep yet. Have several tickets with support but yet unable to resolve.

1. Duplicates and triples in the environment. What I mean by this is for example I'll have one device three time with three different ip's. Or the other way around 3 devices 3 times added with three separate ips. Still haven't found a way to pull this on a report to go fix these devices.

2. Monitoring for snmp and wmi failures. It seems like creating a SAM template would be the best way to go. Can anyone confirm? Simply what I'm trying to do is create a way that solarwinds can send me an email when a device stops polling snmp or wmi.

3. Overloaded SAM. So with close to 300 sql's in appinisight for sql with about 2 to over 50 db's per server. It easily overloaded SAM in component count. What's a more efficient way to monitor sql? Suggestions welcomed.

4. Performance issues. This seems related to disk performance but I have no way to figure out what is the root cause.

5. data integrity in the database. I don't know how to run checks for integrity. and how to make sure I don't have corruption happening.

6. pollers all hanging due to collector and business layer peaking cpu and ram.

These are the top six pressing issues. Any help welcomed.

↧

Agent monitoring brings me endless headaches.

June 26, 2018, 1:46 pm

≫ Next: Help to know how Solarwinds count/calculate the ping response for both SNMP and ICMP?

≪ Previous: Question for the community... Need your help with some issues I'm having.

Pain points:

1. Agents causing certain monitored servers to spike in CPU and others in ram and some spike in both. On NPM version 12.2 .. Can't for the life of me figure out why?

2. Agents causing problems with pollers. job engine spiking. Collectors crashing. And ephemeral port spike. How much agents per poller can the pollers handle? Am I overloading my system?

3. On my DMZ servers I can't get anything to work. Even less with agents. Even if manually installed I can't get them to communicate with host. Should I place a poller in the DMZ to make this happen?

4. Moving devices from poller to poller I'm having to manually go into manage agent and move the devices to a different poller manually.

5. 2003 servers are a pain to monitor no matter which way you choose. Even with agents they still like to be problem child's. Have issues trying to figure out good way to monitor these servers. Is agent the way to go?

Just the top 5 pressing issues. Help would be appreciated thanks.

↧

Help to know how Solarwinds count/calculate the ping response for both SNMP and ICMP?

June 26, 2018, 2:07 pm

≫ Next: Help to know how Solarwinds count/calculate the ping response for both SNMP and ICMP?

≪ Previous: Agent monitoring brings me endless headaches.

Hi,

I wanted to know how SolarWinds calculate the ping response for both SNMP and ICMP?

↧