SNMP bulkwalk intermittently fails with "OID not increasing" for TCP-MIB::tcpListenerProcess

Question

Hello, I am investigating an SNMP issue observed in a production environment where snmpbulkwalk intermittently fails with the following error: Error: OID not increasing TCP-MIB::tcpListenerProcess.ipv4."127.0.0.1".53 This occurs while walking the TCP listener table (.1.3.6.1.2.1.6.20). Environment: * OS: AlmaLinux * Kernel: * net-snmp: Test setup: * Configured SNMP with full MIB access (view all included .1) * Generated multiple TCP listeners (via nc and local services) * snmpbulkwalk -v2c -c localhost TCP-MIB::tcpListenerProcess Observed behavior (test environment): * SNMP walk completes successfully * Output appears correctly ordered * Unable to reproduce the error Observed behavior (production): * Error: OID not increasing * Appears to be tied specifically to tcpListenerProcess Workaround applied: Excluding the subtree resolves the issue: view limited included .1.3.6.1.2.1 view limited excluded .1.3.6.1.2.1.6.20 Questions: * Under what conditions can tcpListenerProcess return non-monotonic or duplicate OIDs? * Is this a known limitation/bug in net-snmp when walking kernel TCP tables? * Could this be caused by race conditions while reading /proc/net/tcp or similar kernel interfaces? * Are there known kernel versions or high socket churn scenarios that trigger this behavior? * Is there a recommended way to reliably reproduce this issue for testing? Additional notes: * The issue appears intermittent and environment-specific * Suspected interaction between SNMP walk timing and dynamic TCP socket state * Reboot temporarily resolved the issue in production Any insights into root cause or reproduction strategies would be greatly appreciated. Thanks.

muhammad.osama · Answer

1. Under what conditions can tcpListenerProcess return non-monotonic OIDs? In net-snmp, the tcpListenerTable (.1.3.6.1.2.1.6.20) is populated by scanning the kernel's current socket list. * Dynamic Re-sorting: If a TCP listener is closed and a new one is opened between PDU requests during a bulkwalk, the index order changes. * The GetBulk Leap: snmpbulkwalk asks for multiple OIDs at once (the max-repetitions value). If the underlying kernel data structure shifts while the agent is iterating through the requested batch, it may repeat an OID or return one that was previously "behind" it in the list. * Duplicate Indexing: If the kernel reporting mechanism (like netlink or /proc) momentarily shows transient states, net-snmp might map two different internal socket structures to the same OID index. 2. Is this a known limitation/bug in net-snmp? Yes. It is a known architectural challenge rather than a simple "patchable" bug.The tcpListenerTable is indexed by tcpListenerAddrType, tcpListenerAddr, and tcpListenerPort. Because the Linux kernel does not provide a "snapshot" of the socket table to userspace, net-snmp must iterate through it. If the table is large or changing rapidly, the iteration becomes inconsistent. This is documented in various net-snmp bug trackers relating to the TCP-MIB and UDP-MIB implementations. 3. Race conditions and kernel interfaces The net-snmp agent typically uses Netlink (specifically RTM_GETDIAG) or reads /proc/net/tcp to build this table. * Netlink: While faster, it still provides a point-in-time stream. High socket churn (services restarting, load balancers cycling) causes the kernel to update its internal buckets. * Atomicity: There is no "locking" of the socket table for the SNMP agent. If snmpd is halfway through reading the table and a new socket is inserted at the "top" (a lower port number), the next "GetNext" request might loop back to that lower OID. 4. Trigger Scenarios (High Socket Churn) You are likely seeing this in production but not in test because of socket churn and table size. * Ephemeral Port Exhaustion: If the production server handles thousands of short-lived connections, the kernel tables are under constant flux. * Microservices/Containers: On AlmaLinux, if you are running many containers (Podman/Docker), the tcpListenerTable can become very large, increasing the time snmpd spends processing the walk and widening the "window of failure" for a race condition. * Kernel Versions: Newer kernels (5.x+) used in AlmaLinux are more efficient with Netlink, but the fundamental lack of a table "snapshot" for SNMP remains. 5. Recommended Reproduction Strategy To reproduce this in your test environment, you must simulate high-frequency socket cycling while performing the walk. * Bashwhile true; do PORT=$(( ( RANDOM % 1000 ) + 1000 )); timeout 0.1s nc -l $PORT; done * Bashsnmpbulkwalk -v2c -c -Cr50 localhost .1.3.6.1.2.1.6.20 * Simulate Latency: Use tc (traffic control) to add slight jitter to the loopback interface to desync the SNMP request processing from the kernel table reads. Recommended Permanent Solution If excluding the OID is not acceptable, the most robust fix is to use the snmpgetnext (v1 style) or force a max-repetitions of 1. * Command-line fix: snmpwalk -v2c -Cc ... (The -Cc flag tells snmpwalk to ignore OID ordering errors and continue). * Configuration fix: In snmpd.conf, you can use the cacheTime setting for certain MIB modules to force net-snmp to cache the table for a few seconds rather than re-reading the kernel for every single PDU in the walk. Bash # Example for snmpd.conf to increase cache (if supported by your net-snmp build)# This prevents the agent from hitting the kernel mid-walkmfd_skip_cache_timeout 1

muhammad.osama · Answer

1. Under what conditions can tcpListenerProcess return non-monotonic OIDs? In net-snmp, the tcpListenerTable (.1.3.6.1.2.1.6.20) is populated by scanning the kernel's current socket list. * Dynamic Re-sorting: If a TCP listener is closed and a new one is opened between PDU requests during a bulkwalk, the index order changes. * The GetBulk Leap: snmpbulkwalk asks for multiple OIDs at once (the max-repetitions value). If the underlying kernel data structure shifts while the agent is iterating through the requested batch, it may repeat an OID or return one that was previously "behind" it in the list. * Duplicate Indexing: If the kernel reporting mechanism (like netlink or /proc) momentarily shows transient states, net-snmp might map two different internal socket structures to the same OID index. 2. Is this a known limitation/bug in net-snmp? Yes. It is a known architectural challenge rather than a simple "patchable" bug.The tcpListenerTable is indexed by tcpListenerAddrType, tcpListenerAddr, and tcpListenerPort. Because the Linux kernel does not provide a "snapshot" of the socket table to userspace, net-snmp must iterate through it. If the table is large or changing rapidly, the iteration becomes inconsistent. This is documented in various net-snmp bug trackers relating to the TCP-MIB and UDP-MIB implementations. 3. Race conditions and kernel interfaces The net-snmp agent typically uses Netlink (specifically RTM_GETDIAG) or reads /proc/net/tcp to build this table. * Netlink: While faster, it still provides a point-in-time stream. High socket churn (services restarting, load balancers cycling) causes the kernel to update its internal buckets. * Atomicity: There is no "locking" of the socket table for the SNMP agent. If snmpd is halfway through reading the table and a new socket is inserted at the "top" (a lower port number), the next "GetNext" request might loop back to that lower OID. 4. Trigger Scenarios (High Socket Churn) You are likely seeing this in production but not in test because of socket churn and table size. * Ephemeral Port Exhaustion: If the production server handles thousands of short-lived connections, the kernel tables are under constant flux. * Microservices/Containers: On AlmaLinux, if you are running many containers (Podman/Docker), the tcpListenerTable can become very large, increasing the time snmpd spends processing the walk and widening the "window of failure" for a race condition. * Kernel Versions: Newer kernels (5.x+) used in AlmaLinux are more efficient with Netlink, but the fundamental lack of a table "snapshot" for SNMP remains. 5. Recommended Reproduction Strategy To reproduce this in your test environment, you must simulate high-frequency socket cycling while performing the walk. * Bashwhile true; do PORT=$(( ( RANDOM % 1000 ) + 1000 )); timeout 0.1s nc -l $PORT; done * Bashsnmpbulkwalk -v2c -c -Cr50 localhost .1.3.6.1.2.1.6.20 * Simulate Latency: Use tc (traffic control) to add slight jitter to the loopback interface to desync the SNMP request processing from the kernel table reads. Recommended Permanent Solution If excluding the OID is not acceptable, the most robust fix is to use the snmpgetnext (v1 style) or force a max-repetitions of 1. * Command-line fix: snmpwalk -v2c -Cc ... (The -Cc flag tells snmpwalk to ignore OID ordering errors and continue). * Configuration fix: In snmpd.conf, you can use the cacheTime setting for certain MIB modules to force net-snmp to cache the table for a few seconds rather than re-reading the kernel for every single PDU in the walk. Bash # Example for snmpd.conf to increase cache (if supported by your net-snmp build)# This prevents the agent from hitting the kernel mid-walkmfd_skip_cache_timeout 1