22 Replies Latest reply on Jan 8, 2014 12:07 AM by aigidm

    LEM agent question

    evanr

      Does the spop.conf query its info directly from a file on the LEM box?  For some reason when installing the agent on a brand new machine the spop.conf is populating with the old appliance IP address.  When LEM was first installed it was using .164 as its IP.  We have since moved the vm to a beefier box and re-deployed it with an IP of .167.  I can ping the DNS name just fine, and have verified there are no stale records showing the old .164 address.  I can also telnet to port 37891,37890,37892 via the new IP.  Ultimately I can edit the spop.conf and manually change the IP to .167 but it's very annoying.  The user guide mentions clearing the agent certificate but so far I have not found any in the local cert stores.  (Edit: This must be done when deleting the spop folder within the ContegoSPOP folder.) I can also manually edit the .conf file and use the DNS name as well w/o issues.  But where is it getting its initial query? IE NioComNetworkParent Making install request to: x.x.x.164

        • Re: LEM agent question
          rk7708

          I am new to LEM. This morning I was with this lady from Solarwinds told me that SPOP folder contains certificates.

          So for example, if i push an agent to computer A and then that computer's IP or name changes for some reason and it shows as disconnected. Then we have to stop the solarwind agent services on that PC and delete the spop folder. After that, restart services again and fresh spop folder will be recreated. Repush the agent on that computer and it will show as connected. So SPOP actually communicates with LEM manager for certificate purposes.

           

           

          This product is so complicated and confusing to work with though.

          2 of 2 people found this helpful
            • Re: LEM agent question
              evanr

              Do you know where it pulls the initial info from though?  These are brand new installs but for whatever reason the .conf file is populating with the old ip.  Like you mentioned I have been able to stop the service, edit the .conf file, delete the spop folder, and restart services and all is good.  But I would rather not have to do that on 20 machines.  I'm just wondering why its populating with the old information in the first place.

                • Re: LEM agent question
                  rk7708

                  Sorry, I wouldn't know. I just started using this App and its already giving me a hard time. But I would check non-agent nodes and disconnected nodes to see if there are any duplicate IP addresses there. Clean those up.

                   

                  I don't even know how to edit that .conf file. I have to look into it.

                   

                  I have a question. May be you can help me on this -

                   

                  I have this uninstaller utility from the portal and I would like to uninstall the agent from the remote computer but I am unable to do so. LEM said that its UAC (User control access) issue with windows 7 and 2008 and I have to do it manually. This is ridiculous.

              • Re: LEM agent question
                nicole pauls

                The first time the spop.conf is created - when installing the agent - you provide it in the installer prompts.

                 

                If your agent is installed in an image, it's probably been saved with the image, and you need to update the image's spop.conf (or create a new image).

                 

                The certificate files are only created AFTER first connect between the LEM appliance and LEM agent, so if that has never happened, the first spop.conf is created by first install of the agent.

                1 of 1 people found this helpful
                  • Re: LEM agent question
                    evanr

                    So since our recent upgrade to 5.6.0 we have been having some issues with agent coming online then dropping off.

                     

                    (Fri Jul 05 11:14:52 CDT 2013) II:NOTICE [NioCenter v23873] {ComModuleSpop:21} Initializing Nio Center.;

                    (Fri Jul 05 11:14:52 CDT 2013) II:NOTICE [NioCenter v23873] {NioComNetworkParent:1050} Nio Center successfully initialized.;

                    (Fri Jul 05 11:14:52 CDT 2013) II:NOTICE [NioCenter v23873] {NioComNetworkParent:1050} Nio Center Connecting.;

                    (Fri Jul 05 11:15:13 CDT 2013) EE:ERR [NioSelector v23873] {NioComNetworkParent:1050} Connection status: Unable to complete nio  connection to address x.x.x.x/37892  Connection timed out: no further information;

                    (Fri Jul 05 11:15:13 CDT 2013) II:NOTICE [NioCenter v23873] {NioComNetworkParent:1050} Closing Nio Center: Nio Thread has stopped;

                    (Fri Jul 05 11:15:13 CDT 2013) II:NOTICE [EncryptRouter v23873] {NioEncryptRouter:1053} EncryptRouter exiting;

                    (Fri Jul 05 11:15:13 CDT 2013) II:NOTICE [Communications] {ComModuleSpop:21} Parent disconnection signaled: ( id:0 ):  Disconnect reason: Unable to initialize;

                    (Fri Jul 05 11:15:13 CDT 2013) II:NOTICE [NioComNetworkParent v24745] {ComModuleSpop:21} niocenter is null;

                    (Fri Jul 05 11:15:13 CDT 2013) II:NOTICE [NioComNetworkParent v24745] {ComModuleSpop:21} Closed connection to manager: Unable to initialize;

                    (Fri Jul 05 11:15:13 CDT 2013) II:NOTICE [NioComNetworkParent v24745] {ComModuleSpop:21} Failed opening nio connection to manager 'x.x.x.x';

                    (Fri Jul 05 11:15:14 CDT 2013) II:NOTICE [DecryptRouter v23873] {NioDecryptRouter:1052} DecryptRouter exiting;

                    (Fri Jul 05 11:15:14 CDT 2013) II:NOTICE [LogManagementRegistryClient v23873] {ComModuleSpop:21} parentDisconnected;

                     

                    I have stopped the SolarWinds Log and Event Manager Agent.  Deleted the spop folder in the ContegoSPOP folder.  Verified the .conf file is either using the DNS name or correct IP.  Started the agent back up.  It will pop up in the Nodes list but never fully connects.  Any other suggestions other than completely removing the agent and re-installing it?

                     

                    (Fri Jul 05 11:41:59 CDT 2013) II:NOTICE [Contego] {SPOP:8} Initializing database;

                    (Fri Jul 05 11:41:59 CDT 2013) II:NOTICE [Contego] {SPOP:8} Database Initialized;

                    (Fri Jul 05 11:41:59 CDT 2013) II:NOTICE [Contego] {Initialize Communications:10} Initializing Agent communications;

                    (Fri Jul 05 11:41:59 CDT 2013) II:NOTICE [Contego] {Initialize Tools:13} Initializing ToolAPI;

                    (Fri Jul 05 11:41:59 CDT 2013) II:NOTICE [Contego] {Initialize Tools:13} Initializing FAST;

                    (Fri Jul 05 11:41:59 CDT 2013) WW:STATUS [Communications] Operating System == Windows Server 2008;6.0;x86

                    (Fri Jul 05 11:41:59 CDT 2013) II:NOTICE [NioComNetworkParent v24745] {Initialize Communications:10} CheckUSBDefender returned installed and running;

                    (Fri Jul 05 11:41:59 CDT 2013) DD:DEBUG 1 [Communications] Max number of agent install attempt property is not a numerical value, default to 10

                    (Fri Jul 05 11:41:59 CDT 2013) II:NOTICE [NioComNetworkParent v24745] {ComModuleSpop:20} Making install request to: slem;

                    (Fri Jul 05 11:42:02 CDT 2013) II:NOTICE [NioComNetworkParent v24745] {ComModuleSpop:20} Install request completed (favorably);

                    (Fri Jul 05 11:42:02 CDT 2013) WW:STATUS [Communications] This entity has got its certificate signed by the parent.

                    (Fri Jul 05 11:42:02 CDT 2013) WW:STATUS [ComModule] Opening installed connection to parent.

                    (Fri Jul 05 11:42:02 CDT 2013) II:NOTICE [NioCenter v23873] {ComModuleSpop:20} Initializing Nio Center.;

                    (Fri Jul 05 11:42:02 CDT 2013) II:NOTICE [NioCenter v23873] {NioComNetworkParent:21} Nio Center successfully initialized.;

                    (Fri Jul 05 11:42:02 CDT 2013) II:NOTICE [NioCenter v23873] {NioComNetworkParent:21} Nio Center Connecting.;

                    (Fri Jul 05 11:42:23 CDT 2013) EE:ERR [NioSelector v23873] {NioComNetworkParent:21} Connection status: Unable to complete nio  connection to address slem/37892  Connection timed out: no further information;

                    (Fri Jul 05 11:42:23 CDT 2013) II:NOTICE [NioCenter v23873] {NioComNetworkParent:21} Closing Nio Center: Nio Thread has stopped;

                    (Fri Jul 05 11:42:23 CDT 2013) II:NOTICE [DecryptRouter v23873] {NioDecryptRouter:23} DecryptRouter exiting;

                    (Fri Jul 05 11:42:24 CDT 2013) II:NOTICE [EncryptRouter v23873] {NioEncryptRouter:24} EncryptRouter exiting;

                    (Fri Jul 05 11:42:24 CDT 2013) II:NOTICE [Communications] {ComModuleSpop:20} Parent disconnection signaled: ( id:0 ):  Disconnect reason: Unable to initialize;

                    (Fri Jul 05 11:42:24 CDT 2013) II:NOTICE [NioComNetworkParent v24745] {ComModuleSpop:20} niocenter is null;

                    (Fri Jul 05 11:42:24 CDT 2013) II:NOTICE [NioComNetworkParent v24745] {ComModuleSpop:20} Closed connection to manager: Unable to initialize;

                    (Fri Jul 05 11:42:24 CDT 2013) II:NOTICE [NioComNetworkParent v24745] {ComModuleSpop:20} Failed opening nio connection to manager 'slem';

                    (Fri Jul 05 11:42:24 CDT 2013) II:NOTICE [Contego] {Initialize RCC Server:12} Initializing the Tool Message Center;

                    (Fri Jul 05 11:42:24 CDT 2013) II:NOTICE [Contego] {Initialize Secret Center:11} Initializing the secret center;

                    (Fri Jul 05 11:42:24 CDT 2013) II:NOTICE [Contego] {Initialize RCC Server:12} Initializing the Tool API Command Center;

                    (Fri Jul 05 11:42:24 CDT 2013) WW:STATUS [FastEvaluator] Added ToolId:NtSystem

                    (Fri Jul 05 11:42:24 CDT 2013) WW:STATUS [FastEvaluator] Added ToolId:NT Application

                    (Fri Jul 05 11:42:24 CDT 2013) WW:STATUS [FastEvaluator] Added ToolId:VistaSecurity

                    (Fri Jul 05 11:42:25 CDT 2013) DD:DEBUG 2 [ComModuleSpop] Entering new stuck message into hash: 800

                    (Fri Jul 05 11:42:25 CDT 2013) II:NOTICE [LogManagementRegistryClient v23873] {ComModuleSpop:20} parentDisconnected;

                    (Fri Jul 05 11:42:38 CDT 2013) WW:WARNING [BuffBytesOneReaderOneWriter v24761] {pool-1-thread-1:35}  CommDataQueue 35 Queue file disk space has exceeded 10240 KBs which is the maximum allowed;

                    (Fri Jul 05 11:42:38 CDT 2013) WW:WARNING [BuffBytesOneReaderOneWriter v24761] {pool-1-thread-1:35}  CommDataQueue 35 Buffer dump to queue file cancelled;

                    (Fri Jul 05 11:42:38 CDT 2013) WW:STATUS [GC] Mem free before: 1644896 Mem free after: 12636160 mem total: 21528576

                    (Fri Jul 05 11:42:55 CDT 2013) II:NOTICE [NioCenter v23873] {ComModuleSpop:20} Initializing Nio Center.;

                    (Fri Jul 05 11:42:55 CDT 2013) II:NOTICE [NioCenter v23873] {NioComNetworkParent:37} Nio Center successfully initialized.;

                    (Fri Jul 05 11:42:55 CDT 2013) II:NOTICE [NioCenter v23873] {NioComNetworkParent:37} Nio Center Connecting.;

                    (Fri Jul 05 11:43:16 CDT 2013) EE:ERR [NioSelector v23873] {NioComNetworkParent:37} Connection status: Unable to complete nio  connection to address slem/37892  Connection timed out: no further information;

                    (Fri Jul 05 11:43:16 CDT 2013) II:NOTICE [NioCenter v23873] {NioComNetworkParent:37} Closing Nio Center: Nio Thread has stopped;

                    (Fri Jul 05 11:43:16 CDT 2013) II:NOTICE [Communications] {ComModuleSpop:20} Parent disconnection signaled: ( id:0 ):  Disconnect reason: Unable to initialize;

                    (Fri Jul 05 11:43:16 CDT 2013) II:NOTICE [NioComNetworkParent v24745] {ComModuleSpop:20} niocenter is null;

                    (Fri Jul 05 11:43:16 CDT 2013) II:NOTICE [NioComNetworkParent v24745] {ComModuleSpop:20} Closed connection to manager: Unable to initialize;

                    (Fri Jul 05 11:43:16 CDT 2013) II:NOTICE [NioComNetworkParent v24745] {ComModuleSpop:20} Failed opening nio connection to manager 'slem';

                    (Fri Jul 05 11:43:16 CDT 2013) II:NOTICE [EncryptRouter v23873] {NioEncryptRouter:40} EncryptRouter exiting;

                    (Fri Jul 05 11:43:16 CDT 2013) II:NOTICE [DecryptRouter v23873] {NioDecryptRouter:39} DecryptRouter exiting;

                    (Fri Jul 05 11:43:17 CDT 2013) II:NOTICE [LogManagementRegistryClient v23873] {ComModuleSpop:20} parentDisconnected;

                    (Fri Jul 05 11:44:21 CDT 2013) II:NOTICE [NioCenter v23873] {ComModuleSpop:20} Initializing Nio Center.;

                    (Fri Jul 05 11:44:21 CDT 2013) II:NOTICE [NioCenter v23873] {NioComNetworkParent:41} Nio Center successfully initialized.;

                    (Fri Jul 05 11:44:21 CDT 2013) II:NOTICE [NioCenter v23873] {NioComNetworkParent:41} Nio Center Connecting.;

                    (Fri Jul 05 11:44:42 CDT 2013) EE:ERR [NioSelector v23873] {NioComNetworkParent:41} Connection status: Unable to complete nio  connection to address slem/37892  Connection timed out: no further information;

                    (Fri Jul 05 11:44:42 CDT 2013) II:NOTICE [NioCenter v23873] {NioComNetworkParent:41} Closing Nio Center: Nio Thread has stopped;

                    (Fri Jul 05 11:44:42 CDT 2013) II:NOTICE [EncryptRouter v23873] {NioEncryptRouter:44} EncryptRouter exiting;

                    (Fri Jul 05 11:44:42 CDT 2013) II:NOTICE [Communications] {ComModuleSpop:20} Parent disconnection signaled: ( id:0 ):  Disconnect reason: Unable to initialize;

                    (Fri Jul 05 11:44:42 CDT 2013) II:NOTICE [NioComNetworkParent v24745] {ComModuleSpop:20} niocenter is null;

                    (Fri Jul 05 11:44:42 CDT 2013) II:NOTICE [NioComNetworkParent v24745] {ComModuleSpop:20} Closed connection to manager: Unable to initialize;

                    (Fri Jul 05 11:44:42 CDT 2013) II:NOTICE [NioComNetworkParent v24745] {ComModuleSpop:20} Failed opening nio connection to manager 'slem';

                    (Fri Jul 05 11:44:42 CDT 2013) II:NOTICE [DecryptRouter v23873] {NioDecryptRouter:43} DecryptRouter exiting;

                    (Fri Jul 05 11:44:43 CDT 2013) II:NOTICE [LogManagementRegistryClient v23873] {ComModuleSpop:20} parentDisconnected;

                      • Re: LEM agent question
                        antioch

                        Did you use the local or the remote installer? I had a huge issue when I first started using lem with the local installer, it seems to be incredibly unstable and constantly drops and loses connection, my log looked exactly like yours constantly. what I would recommend doing if you haven't already is using the remote installer to roll the agent to any and all pc's that need it, even if its  just the local host you can still use the remote agent installer and it should clear up any issues you have.

                    • Re: LEM agent question
                      evanr

                      Well I figured I would try one more time before I open a support ticket.  We still have 3 or 4 agent machines that were connected to LEM at one time but for whatever reason refuse to re-establish the connection again.  I have tried the usual stopping the service, deleting the spop folder, and starting the service.  I tried uninstalling and re-installing the agent via the local tool.  Tried removing the agent then deploying via the remote agent.  So far nothing.  Still the same errors.  I'm able to telnet to the LEM box on the necessary ports in the .conf file, using the IP and DNS name.  Any ideas? Here is the log output. 

                       

                      (Fri Jul 26 08:56:36 CDT 2013) II:NOTICE [Contego] {SPOP:8} Starting TriGeo Agent (Release 5.3.1) build [1];

                      (Fri Jul 26 08:56:36 CDT 2013) II:NOTICE [SpopModule v24798] {SPOP:8} build server version string: 5.3.1-CORE_132.25174.51;

                      (Fri Jul 26 08:56:36 CDT 2013) II:NOTICE [InDepthConfigProps v24744] {SPOP:8} nDepth enabled via default because InDepthEnable not present;

                      (Fri Jul 26 08:56:36 CDT 2013) II:NOTICE [InDepthConfigProps v24744] {SPOP:8} indepth.conf not found at C:\WINDOWS\SysWOW64\ContegoSPOP\indepth.conf;

                      (Fri Jul 26 08:56:36 CDT 2013) II:NOTICE [RawDataClient v24744] {SPOP:8} Status Inactive;

                      (Fri Jul 26 08:56:36 CDT 2013) II:NOTICE [Contego] {SPOP:8} Initializing database;

                      (Fri Jul 26 08:56:36 CDT 2013) II:NOTICE [Contego] {SPOP:8} Database Initialized;

                      (Fri Jul 26 08:56:36 CDT 2013) II:NOTICE [Contego] {Initialize Communications:10} Initializing Agent communications;

                      (Fri Jul 26 08:56:36 CDT 2013) II:NOTICE [Contego] {Initialize Tools:13} Initializing ToolAPI;

                      (Fri Jul 26 08:56:36 CDT 2013) II:NOTICE [Contego] {Initialize Tools:13} Initializing FAST;

                      (Fri Jul 26 08:56:36 CDT 2013) WW:STATUS [Communications] Operating System == Windows Server 2008;6.0;x86

                      (Fri Jul 26 08:56:40 CDT 2013) II:NOTICE [NioComNetworkParent v24745] {Initialize Communications:10} CheckUSBDefender returned installed and running;

                      (Fri Jul 26 08:56:40 CDT 2013) DD:DEBUG 1 [Communications] Max number of agent install attempt property is not a numerical value, default to 10

                      (Fri Jul 26 08:56:40 CDT 2013) II:NOTICE [NioComNetworkParent v24745] {ComModuleSpop:20} Making install request to: 172.20.0.167;

                      (Fri Jul 26 08:56:41 CDT 2013) II:NOTICE [NioComNetworkParent v24745] {ComModuleSpop:20} Install request completed (favorably);

                      (Fri Jul 26 08:56:41 CDT 2013) WW:STATUS [Communications] This entity has got its certificate signed by the parent.

                      (Fri Jul 26 08:56:41 CDT 2013) WW:STATUS [ComModule] Opening installed connection to parent.

                      (Fri Jul 26 08:56:41 CDT 2013) II:NOTICE [NioCenter v23873] {ComModuleSpop:20} Initializing Nio Center.;

                      (Fri Jul 26 08:56:41 CDT 2013) II:NOTICE [NioCenter v23873] {NioComNetworkParent:21} Nio Center successfully initialized.;

                      (Fri Jul 26 08:56:41 CDT 2013) II:NOTICE [NioCenter v23873] {NioComNetworkParent:21} Nio Center Connecting.;

                      (Fri Jul 26 08:57:02 CDT 2013) EE:ERR [NioSelector v23873] {NioComNetworkParent:21} Connection status: Unable to complete nio  connection to address 172.20.0.167/37892  Connection timed out: no further information;

                      (Fri Jul 26 08:57:02 CDT 2013) II:NOTICE [NioCenter v23873] {NioComNetworkParent:21} Closing Nio Center: Nio Thread has stopped;

                      (Fri Jul 26 08:57:02 CDT 2013) II:NOTICE [DecryptRouter v23873] {NioDecryptRouter:23} DecryptRouter exiting;

                      (Fri Jul 26 08:57:02 CDT 2013) II:NOTICE [EncryptRouter v23873] {NioEncryptRouter:24} EncryptRouter exiting;

                      (Fri Jul 26 08:57:02 CDT 2013) II:NOTICE [Communications] {ComModuleSpop:20} Parent disconnection signaled: ( id:0 ):  Disconnect reason: Unable to initialize;

                      (Fri Jul 26 08:57:02 CDT 2013) II:NOTICE [NioComNetworkParent v24745] {ComModuleSpop:20} niocenter is null;

                      (Fri Jul 26 08:57:02 CDT 2013) II:NOTICE [NioComNetworkParent v24745] {ComModuleSpop:20} Closed connection to manager: Unable to initialize;

                      (Fri Jul 26 08:57:02 CDT 2013) II:NOTICE [NioComNetworkParent v24745] {ComModuleSpop:20} Failed opening nio connection to manager '172.20.0.167';

                      (Fri Jul 26 08:57:03 CDT 2013) II:NOTICE [Contego] {Initialize RCC Server:12} Initializing the Tool Message Center;

                      (Fri Jul 26 08:57:03 CDT 2013) II:NOTICE [Contego] {Initialize Secret Center:11} Initializing the secret center;

                      (Fri Jul 26 08:57:03 CDT 2013) II:NOTICE [Contego] {Initialize RCC Server:12} Initializing the Tool API Command Center;

                      (Fri Jul 26 08:57:03 CDT 2013) WW:STATUS [FastEvaluator] Added ToolId:NtSystem

                      (Fri Jul 26 08:57:03 CDT 2013) WW:STATUS [FastEvaluator] Added ToolId:NT Application

                      (Fri Jul 26 08:57:03 CDT 2013) WW:STATUS [FastEvaluator] Added ToolId:VistaSecurity

                      (Fri Jul 26 08:57:03 CDT 2013) DD:DEBUG 2 [ComModuleSpop] Entering new stuck message into hash: 800

                      (Fri Jul 26 08:57:03 CDT 2013) II:NOTICE [LogManagementRegistryClient v23873] {ComModuleSpop:20} parentDisconnected;

                      • Re: LEM agent question
                        evanr

                        I hate to beat a dead horse here but I upped the log output to something other than the default 3.  This particular machine just will not connect.   I can telnet to the ports fine from this particular machine, and other servers that sit on its LAN are able to hit our LEM device w/o issue.

                         

                        I get two consistent error messages.  One where the client can't seem to establish and keep the connection.

                         

                        (Fri Aug 30 15:24:37 CDT 2013) DD:DEBUG MODE [NioSelector v23873] {NioComNetworkParent:52} Connecting to server using address: '172.20.0.167'  and port: '37892';

                        (Fri Aug 30 15:24:37 CDT 2013) DD:DEBUG [NioSelector v23873] {NioComNetworkParent:52} Successful binding to port: 37893;

                        (Fri Aug 30 15:24:38 CDT 2013) DD:DEBUG MODE [NioComNetworkParent v24745] {BBS:DequeueToComm-1:28} Waiting on trigeoauth before sending message.;

                        (Fri Aug 30 15:24:38 CDT 2013) DD:DEBUG MODE [NioComNetworkParent v24745] {BBS:DequeueToComm-1:28} Setting kill timer for: sendPacketViaDataChannel;

                        (Fri Aug 30 15:24:45 CDT 2013) DD:DEBUG MODE [NioComNetworkParent v24745] {Timer-2:25} Waiting on trigeoauth before sending message.;

                        (Fri Aug 30 15:24:45 CDT 2013) DD:DEBUG MODE [NioComNetworkParent v24745] {Timer-2:25} Setting kill timer for: sendPacketViaDataChannel;

                        (Fri Aug 30 15:24:59 CDT 2013) EE:ERR [NioSelector v23873] {NioComNetworkParent:52} Connection status: Unable to complete nio  connection to address 172.20.0.167/37892  Connection timed out: no further information;

                        (Fri Aug 30 15:24:59 CDT 2013) DD:DEBUG MODE [NioSelector v23873] {NioComNetworkParent:52} EXCEPTION: java.net.ConnectException: Connection timed out: no further information

                          at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method)

                          at sun.nio.ch.SocketChannelImpl.finishConnect(Unknown Source)

                          at com.trigeo.core.communications.nio.client.NioSelectorOnClient.initiateConnection(NioSelectorOnClient.java:200)

                          at com.trigeo.core.communications.nio.NioCenter.connect(NioCenter.java:329)

                          at com.trigeo.core.communications.nio.NioCenter.run(NioCenter.java:296)

                          at java.lang.Thread.run(Unknown Source)

                          at com.trigeo.util.TriGeoThread.run(TriGeoThread.java:57)

                         

                        The second is where the initial request is made from the agent to the appliance but for whatever reason the data stream is broken.  I can see the connection made on the LEM device but it will eventually drop.

                         

                        $ netstat -ano | grep 172.22.0.41

                        tcp        0      0 172.20.0.167:37890      172.22.0.41:37893       ESTABLISHED off (0.00/0/0)

                         

                        (Fri Aug 30 15:30:04 CDT 2013) DD:DEBUG MODE [NioComNetworkParent v24745] {ComModuleSpop:20} bound to local port: 37893;

                        (Fri Aug 30 15:31:04 CDT 2013) EE:ERR [NioComNetworkParent v24745] {ComModuleSpop:20} EXCEPTION: java.io.EOFException

                          at java.io.ObjectInputStream$BlockDataInputStream.peekByte(Unknown Source)

                          at java.io.ObjectInputStream.readObject0(Unknown Source)

                          at java.io.ObjectInputStream.readObject(Unknown Source)

                          at com.trigeo.core.communications.common.ComNetworkParent.writeMessageToCommandChannel(ComNetworkParent.java:1217)

                          at com.trigeo.core.communications.common.ComNetworkParent.sendParentViaCommandChannelForResponse(ComNetworkParent.java:328)

                          at com.trigeo.core.communications.common.ComNetworkParent.installRequest(ComNetworkParent.java:247)

                          at com.trigeo.core.communications.nio.client.NioComNetworkParent.installRequest(NioComNetworkParent.java:107)

                          at com.trigeo.core.communications.common.ComModule.autoInstall(ComModule.java:550)

                          at com.trigeo.core.communications.common.ComModule.setUp(ComModule.java:364)

                          at com.trigeo.core.communications.spop.ComModuleSpop.run(ComModuleSpop.java:172)

                          at java.lang.Thread.run(Unknown Source)

                          at com.trigeo.util.TriGeoThread.run(TriGeoThread.java:57)

                         

                        Is there something I'm missing java related perhaps?  Another strange thing I noticed was that the CommDataQueue will fill up with about 320 files each only about 32kb for a total of 9.9mb but it seems to puke as well when attempting to send the queued data.

                         

                        (Fri Aug 30 15:15:58 CDT 2013) WW:WARNING [BuffBytesOneReaderOneWriter v24761] {pool-1-thread-1:38}  CommDataQueue 38 Queue file disk space has exceeded 10240 KBs which is the maximum allowed;

                        (Fri Aug 30 15:15:58 CDT 2013) WW:WARNING [BuffBytesOneReaderOneWriter v24761] {pool-1-thread-1:38}  CommDataQueue 38 Buffer dump to queue file cancelled;

                         

                        I'm getting tired of having to uninstall, trash the files from the registry, reboot, and re-install the agent on this machine. 

                        • Re: LEM agent question
                          jwhite@zoll.com

                          I am also having this issue.  Have you found the resolution?

                            • Re: LEM agent question
                              evanr

                              Still getting it.  Seems the javaw process will send the syn_sent packet.  A deeper packet trace will show the three way handshake.  But for some reason the agent machine will send a RST and that is all she wrote.  I suspect there may be an underlying network issue as these machines are in a separate offsite data center that has a site-to-site with our production location.  There have been some latency/packet loss issues as of late so i doubt its an issue with the agent itself.

                            • Re: LEM agent question
                              aigidm

                              No one answered... fools the customers.. Where is the Solarwinds technical person's answer. Even i created the ticket and its pending for the resolution for more than 10 days. No luck..  ??

                              • Re: LEM agent question
                                aigidm

                                Please do not put "This question is Assumed Answered". Bcoz, i do not find any valuable answers for this thread. Thanks. This is my open case ID 559648

                                  • Re: LEM agent question
                                    curtisi

                                    I pulled that case, and we were told that to close it by the customer about 6 hours ago.

                                     

                                    Also, @evanr did provide the answer for his version of the issue: " I suspect there may be an underlying network issue as these machines are in a separate offsite data center that has a site-to-site with our production location.  There have been some latency/packet loss issues as of late so i doubt its an issue with the agent itself."

                                     

                                    Have you investigated network latency and topology as potential problems with your scenario?

                                    • Re: LEM agent question
                                      aigidm

                                      Sorry for the typo....  Case #559643 - "Node is showing as disconnected state."